skip to main content
research-article
Free Access

Toward ML-centric cloud platforms

Published:22 January 2020Publication History
Skip Abstract Section

Abstract

Exploring the opportunities to use ML, the possible designs, and our experience with Microsoft Azure.

References

  1. Abadi, M. et al. TensorFlow: A system for large-scale machine learning. In Proceedings of the 12th USENIX Symp. Operating System Design and Implementation (2016).Google ScholarGoogle Scholar
  2. Agarwal, A. et al. Making contextual decisions with low technical debt. arXiv preprint arXiv:1606.03966 (2016).Google ScholarGoogle Scholar
  3. Amazon Web Services. Amazon CloudWatch; https://aws.amazon.com/cloudwatch/.Google ScholarGoogle Scholar
  4. Bodik, P., Griffith, R., Sutton, C., Fox, A., Jordan, M., and Patterson, D. Statistical machine learning makes automatic control practical for Internet datacenters. In Proceedings of HotCloud (2009).Google ScholarGoogle Scholar
  5. Calheiros, R. N., Masoumi, E., Ranjan, R., and Buyya, R. Workload prediction using ARIMA model and its impact on cloud applications' QoS. IEEE Trans. Cloud Computing 3, 4 (2015).Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Cao, R., Yu, Z., Marbach, T., Li, J., Wang, G., and Liu, X. Load prediction for data centers based on database service. In Proceedings of the 42nd Annual Computer Software and Applications Conf. (2018).Google ScholarGoogle ScholarCross RefCross Ref
  7. Chaiken, R., Jenkins, B., Larson, P., Ramsey, B., Shakib, D., Weaver, S., and Zhou, J. SCOPE: Easy and efficient parallel processing of massive data sets. In Proceedings of the 34th Intern. Conf. Very Large Data Bases (2008).Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Chen, S., Shen, Y., and Zhu, Y. Modeling conceptual characteristics of virtual machines for CPU utilization prediction. In Proceedings of the Intern. Conf. Conceptual Modeling (2018).Google ScholarGoogle ScholarCross RefCross Ref
  9. Cortez, E., Bonde, A., Muzio, A., Russinovich, M., Fontoura, M., and Bianchini, R. Resource central: Understanding and predicting workloads for improved resource management in large cloud platforms. In Proceedings of the Intern. Symp Operating Systems Principles (2017).Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Crankshaw, D. et al. The missing piece in complex analytics: Low latency, scalable model management and serving with Velox. In Proceedings of the 7th Biennial Conf. Innovative Data Systems Research (2015).Google ScholarGoogle Scholar
  11. Crankshaw, D., Wang, X., Zhou, G., Franklin, M. J., Gonzalez, J. E., and Stoica, I. Clipper: A low-latency online prediction serving system. In Proceedings of the 14th Symp. Networked Systems Design and Implementation (2017).Google ScholarGoogle Scholar
  12. Delimitrou, C. and Kozyrakis, C. Paragon: QoS-aware scheduling for heterogeneous datacenters. In Proceedings of the 18th Intern. Conf. Architectural Support for Programming Languages and Operating Systems (2013).Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Fox, A., Kiciman, E., and Patterson, D. Combining statistical monitoring and predictable recovery for self-management. In Proceedings of the 1st Workshop on Self-Managed Systems (2004).Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Gao, J. Machine Learning Applications For Datacenter Optimization, 2014.Google ScholarGoogle Scholar
  15. Gong, Z., Gu, X., and Wilkes, J. Press: Predictive elastic resource scaling for cloud systems. In Proceedings of the Intern. Conf. Network and Service Management (2010).Google ScholarGoogle Scholar
  16. Google. TensorFlow serving; http://tensorflow.github.io/serving/.Google ScholarGoogle Scholar
  17. Islam, S., Keung, J., Lee, K., and Liu, A. Empirical prediction models for adaptive resource provisioning in the cloud. Future Generation Computer Systems 28, 1 (2012).Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Ke, G. et al. LightGBM: A highly efficient gradient boosting decision tree. Advances in Neural Information Processing Systems 30 (2017).Google ScholarGoogle Scholar
  19. Khan, A., Yan, X., Tao, S., and Anerousis, N. Workload characterization and prediction in the cloud: A multiple time series approach. In Proceedings of the Intern. Conf. Network and Service Management (2012).Google ScholarGoogle ScholarCross RefCross Ref
  20. Mao, H., Alizadeh, M., Menache, I., and Kandula, S. Resource management with deep reinforcement learning. In Proceedings of the 15th ACM Workshop on Hot Topics in Networks (2016).Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Microsoft Azure. Azure Monitor; https://azure.microsoft.com/en-us/services/monitor/.Google ScholarGoogle Scholar
  22. Moritz, P. et al. Ray: A distributed framework for emerging AI applications. In Proceedings of the 13th USENIX Symp. Operating Systems Design and Implementation (2018).Google ScholarGoogle Scholar
  23. Novakovic, D., Vasic, N., Novakovic, S., Kostic, D., and Bianchini, R. DeepDive: Transparently identifying and managing performance interference in virtualized environments. In Proceedings of the USENIX Annual Technical Conf. (2013).Google ScholarGoogle Scholar
  24. Rao, J., Bu, X., Xu, C.-Z., Wang, L., and Yin, G. VCONF: A reinforcement learning approach to virtual machine auto-configuration. In Proceedings of the 6th Intern. Conf. Autonomic Computing (2009).Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Roy, N., Dubey, A., and Gokhale, A. Efficient autoscaling in the cloud using predictive models for workload forecasting. In Proceedings of the Intern. Conf. on Cloud Computing (2011).Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Yadwadkar, N.J. Machine learning for automatic resource management in the datacenter and the cloud. Ph.D. thesis, UC Berkeley, 2018.Google ScholarGoogle Scholar
  27. Zhang, Y., Prekas, G., Fumarola, G. M., Fontoura, M., Goiri, I., and Bianchini, R. History-Based harvesting of spare cycles and storage in large-scale datacenters. In Proceedings of the Intern. Symp. Operating Systems Design and Implementation (2016).Google ScholarGoogle Scholar
  28. Zheng, W., Nguyen, T.D., and Bianchini, R. Automatic configuration of Internet services. In Proceedings of the 2nd European Conf. Computer systems (2007).Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Toward ML-centric cloud platforms

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image Communications of the ACM
          Communications of the ACM  Volume 63, Issue 2
          February 2020
          80 pages
          ISSN:0001-0782
          EISSN:1557-7317
          DOI:10.1145/3380852
          Issue’s Table of Contents

          Copyright © 2020 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 22 January 2020

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Popular
          • Refereed

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format .

        View HTML Format