skip to main content
10.1145/3592533.3592806acmconferencesArticle/Chapter ViewAbstractPublication PageseurosysConference Proceedingsconference-collections
research-article
Open Access

What goes wrong in serverless runtimes? A survey of bugs in Knative Serving

Published:08 May 2023Publication History

ABSTRACT

Serverless runtime systems are complex software artifacts and difficult to make reliable. We present a large-scale empirical study of bugs in serverless runtimes, in the context of the popular open-source Knative Serving serverless platform. We analyze issues reported against Knative Serving over a three-year period and identify broad trends. Our findings shed light on the challenges of building correct, efficient serverless runtimes and suggest fruitful directions for further research.

References

  1. Sebastian Burckhardt, Chris Gillum, David Justo, Konstantinos Kallas, Connor McMahon, and Christopher S. Meiklejohn. 2021. Durable Functions: Semantics for Stateful Serverless. Proc. ACM Program. Lang. 5, OOPSLA, Article 133 (oct 2021), 27 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. CNCF. 2019. CNCF WG-Serverless Whitepaper v1.0. Technical Report. Cloud Native Computing Foundation.Google ScholarGoogle Scholar
  3. Datadog. 2022. The State of Serverless. https://www.datadoghq.com/state-of-serverless/Google ScholarGoogle Scholar
  4. Abhinav Jangda, Donald Pinckney, Yuriy Brun, and Arjun Guha. 2019. Formal Foundations of Serverless Computing. Proc. ACM Program. Lang. 3, OOPSLA, Article 149 (oct 2019), 26 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Nima Kaviani, Dmitriy Kalinin, and Michael Maximilien. 2019. Towards Serverless as Commodity: A Case of Knative. In Proceedings of the 5th International Workshop on Serverless Computing (Davis, CA, USA) (WOSC '19). Association for Computing Machinery, New York, NY, USA, 13--18. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. The Knative Authors. 2022. Knative Runtime Contract. https://github.com/knative/specs/blob/main/specs/serving/runtime-contract.mdGoogle ScholarGoogle Scholar
  7. The Knative Authors. 2023. Knative. https://knative.dev/docs/.Google ScholarGoogle Scholar
  8. The Kubernetes Authors. 2023a. Kubernetes. https://kubernetes.io/.Google ScholarGoogle Scholar
  9. The Kubernetes Authors. 2023b. Pod Lifecycle. https://kubernetes.io/docs/concepts/workloads/pods/pod-lifecycle/#pod-conditionsGoogle ScholarGoogle Scholar
  10. Junfeng Li, Sameer G. Kulkarni, K. K. Ramakrishnan, and Dan Li. 2019. Understanding Open Source Serverless Platforms: Design Considerations and Performance. In Proceedings of the 5th International Workshop on Serverless Computing (Davis, CA, USA) (WOSC '19). Association for Computing Machinery, New York, NY, USA, 37--42. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Anup Mohan, Harshad Sane, Kshitij Doshi, Saikrishna Edupuganti, Naren Nayak, and Vadim Sukhomlinov. 2019. Agile Cold Starts for Scalable Serverless. In 11th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud 19). USENIX Association, Renton, WA. https://www.usenix.org/conference/hotcloud19/presentation/mohanGoogle ScholarGoogle Scholar
  12. Sunil Kumar Mohanty, Gopika Premsankar, and Mario di Francesco. 2018. An Evaluation of Open Source Serverless Computing Frameworks. In 2018 IEEE International Conference on Cloud Computing Technology and Science (CloudCom). 115--120. Google ScholarGoogle ScholarCross RefCross Ref
  13. Rian Shambaugh, Aaron Weiss, and Arjun Guha. 2016. Rehearsal: A Configuration Verification Tool for Puppet. In Proceedings of the 37th ACM SIGPLAN Conference on Programming Language Design and Implementation (Santa Barbara, CA, USA) (PLDI '16). Association for Computing Machinery, New York, NY, USA, 416--430. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Xudong Sun, Runxiang Cheng, Jianyan Chen, Elaine Ang, Owolabi Legunsen, and Tianyin Xu. 2020. Testing Configuration Changes in Context to Prevent Production Failures. In Proceedings of the 14th USENIX Conference on Operating Systems Design and Implementation (OSDI'20). USENIX Association, USA, Article 42, 17 pages.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Xudong Sun, Wenqing Luo, Jiawei Tyler Gu, Aishwarya Ganesan, Ramnatthan Alagappan, Michael Gasch, Lalith Suresh, and Tianyin Xu. 2022. Automatic Reliability Testing For Cluster Management Controllers. In 16th USENIX Symposium on Operating Systems Design and Implementation (OSDI 22). USENIX Association, Carlsbad, CA, 143--159. https://www.usenix.org/conference/osdi22/presentation/sunGoogle ScholarGoogle Scholar
  16. Jinfeng Wen, Zhenpeng Chen, Yi Liu, Yiling Lou, Yun Ma, Gang Huang, Xin Jin, and Xuanzhe Liu. 2021. An Empirical Study on Challenges of Application Development in Serverless Computing. In Proceedings of the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (Athens, Greece) (ESEC/FSE 2021). Association for Computing Machinery, New York, NY, USA, 416--428. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Tianyin Xu, Xinxin Jin, Peng Huang, Yuanyuan Zhou, Shan Lu, Long Jin, and Shankar Pasupathy. 2016. Early Detection of Configuration Errors to Reduce Failure Damage. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16). USENIX Association, Savannah, GA, 619--634. https://www.usenix.org/conference/osdi16/technical-sessions/presentation/xuGoogle ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    SESAME '23: Proceedings of the 1st Workshop on SErverless Systems, Applications and MEthodologies
    May 2023
    64 pages
    ISBN:9798400701856
    DOI:10.1145/3592533

    Copyright © 2023 Owner/Author(s)

    This work is licensed under a Creative Commons Attribution International 4.0 License.

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 8 May 2023

    Check for updates

    Qualifiers

    • research-article

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader