ABSTRACT
Serverless runtime systems are complex software artifacts and difficult to make reliable. We present a large-scale empirical study of bugs in serverless runtimes, in the context of the popular open-source Knative Serving serverless platform. We analyze issues reported against Knative Serving over a three-year period and identify broad trends. Our findings shed light on the challenges of building correct, efficient serverless runtimes and suggest fruitful directions for further research.
- Sebastian Burckhardt, Chris Gillum, David Justo, Konstantinos Kallas, Connor McMahon, and Christopher S. Meiklejohn. 2021. Durable Functions: Semantics for Stateful Serverless. Proc. ACM Program. Lang. 5, OOPSLA, Article 133 (oct 2021), 27 pages. Google ScholarDigital Library
- CNCF. 2019. CNCF WG-Serverless Whitepaper v1.0. Technical Report. Cloud Native Computing Foundation.Google Scholar
- Datadog. 2022. The State of Serverless. https://www.datadoghq.com/state-of-serverless/Google Scholar
- Abhinav Jangda, Donald Pinckney, Yuriy Brun, and Arjun Guha. 2019. Formal Foundations of Serverless Computing. Proc. ACM Program. Lang. 3, OOPSLA, Article 149 (oct 2019), 26 pages. Google ScholarDigital Library
- Nima Kaviani, Dmitriy Kalinin, and Michael Maximilien. 2019. Towards Serverless as Commodity: A Case of Knative. In Proceedings of the 5th International Workshop on Serverless Computing (Davis, CA, USA) (WOSC '19). Association for Computing Machinery, New York, NY, USA, 13--18. Google ScholarDigital Library
- The Knative Authors. 2022. Knative Runtime Contract. https://github.com/knative/specs/blob/main/specs/serving/runtime-contract.mdGoogle Scholar
- The Knative Authors. 2023. Knative. https://knative.dev/docs/.Google Scholar
- The Kubernetes Authors. 2023a. Kubernetes. https://kubernetes.io/.Google Scholar
- The Kubernetes Authors. 2023b. Pod Lifecycle. https://kubernetes.io/docs/concepts/workloads/pods/pod-lifecycle/#pod-conditionsGoogle Scholar
- Junfeng Li, Sameer G. Kulkarni, K. K. Ramakrishnan, and Dan Li. 2019. Understanding Open Source Serverless Platforms: Design Considerations and Performance. In Proceedings of the 5th International Workshop on Serverless Computing (Davis, CA, USA) (WOSC '19). Association for Computing Machinery, New York, NY, USA, 37--42. Google ScholarDigital Library
- Anup Mohan, Harshad Sane, Kshitij Doshi, Saikrishna Edupuganti, Naren Nayak, and Vadim Sukhomlinov. 2019. Agile Cold Starts for Scalable Serverless. In 11th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud 19). USENIX Association, Renton, WA. https://www.usenix.org/conference/hotcloud19/presentation/mohanGoogle Scholar
- Sunil Kumar Mohanty, Gopika Premsankar, and Mario di Francesco. 2018. An Evaluation of Open Source Serverless Computing Frameworks. In 2018 IEEE International Conference on Cloud Computing Technology and Science (CloudCom). 115--120. Google ScholarCross Ref
- Rian Shambaugh, Aaron Weiss, and Arjun Guha. 2016. Rehearsal: A Configuration Verification Tool for Puppet. In Proceedings of the 37th ACM SIGPLAN Conference on Programming Language Design and Implementation (Santa Barbara, CA, USA) (PLDI '16). Association for Computing Machinery, New York, NY, USA, 416--430. Google ScholarDigital Library
- Xudong Sun, Runxiang Cheng, Jianyan Chen, Elaine Ang, Owolabi Legunsen, and Tianyin Xu. 2020. Testing Configuration Changes in Context to Prevent Production Failures. In Proceedings of the 14th USENIX Conference on Operating Systems Design and Implementation (OSDI'20). USENIX Association, USA, Article 42, 17 pages.Google ScholarDigital Library
- Xudong Sun, Wenqing Luo, Jiawei Tyler Gu, Aishwarya Ganesan, Ramnatthan Alagappan, Michael Gasch, Lalith Suresh, and Tianyin Xu. 2022. Automatic Reliability Testing For Cluster Management Controllers. In 16th USENIX Symposium on Operating Systems Design and Implementation (OSDI 22). USENIX Association, Carlsbad, CA, 143--159. https://www.usenix.org/conference/osdi22/presentation/sunGoogle Scholar
- Jinfeng Wen, Zhenpeng Chen, Yi Liu, Yiling Lou, Yun Ma, Gang Huang, Xin Jin, and Xuanzhe Liu. 2021. An Empirical Study on Challenges of Application Development in Serverless Computing. In Proceedings of the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (Athens, Greece) (ESEC/FSE 2021). Association for Computing Machinery, New York, NY, USA, 416--428. Google ScholarDigital Library
- Tianyin Xu, Xinxin Jin, Peng Huang, Yuanyuan Zhou, Shan Lu, Long Jin, and Shankar Pasupathy. 2016. Early Detection of Configuration Errors to Reduce Failure Damage. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16). USENIX Association, Savannah, GA, 619--634. https://www.usenix.org/conference/osdi16/technical-sessions/presentation/xuGoogle Scholar
Recommendations
Serverless Computing: A Survey of Opportunities, Challenges, and Applications
The emerging serverless computing paradigm has attracted attention from both academia and industry. This paradigm brings benefits such as less operational complexity, a pay-as-you-go pricing model, and an auto-scaling feature. The paradigm opens up new ...
The Serverless Computing Survey: A Technical Primer for Design Architecture
The development of cloud infrastructures inspires the emergence of cloud-native computing. As the most promising architecture for deploying microservices, serverless computing has recently attracted more and more attention in both industry and academia. ...
Survey on serverless computing
AbstractServerless computing has gained importance over the last decade as an exciting new field, owing to its large influence in reducing costs, decreasing latency, improving scalability, and eliminating server-side management, to name a few. However, to ...
Comments