skip to main content
research-article

Joint optimization of idle and cooling power in data centers while maintaining response time

Published:13 March 2010Publication History
Skip Abstract Section

Abstract

Server power and cooling power amount to a significant fraction of modern data centers' recurring costs. While data centers provision enough servers to guarantee response times under the maximum loading, data centers operate under much less loading most of the times (e.g., 30-70% of the maximum loading). Previous server-power proposals exploit this under-utilization to reduce the server idle power by keeping active only as many servers as necessary and putting the rest into low-power standby modes. However, these proposals incur higher cooling power due to hot spots created by concentrating the data center loading on fewer active servers, or degrade response times due to standby-to-active transition delays, or both. Other proposals optimize the cooling power but incur considerable idle power. To address the first issue of power, we propose PowerTrade, which trades-off idle power and cooling power for each other, thereby reducing the total power. To address the second issue of response time, we propose SurgeGuard to overprovision the number of active servers beyond that needed by the current loading so as to absorb future increases in the loading. SurgeGuard is a two-tier scheme which uses well-known over-provisioning at coarse time granularities (e.g., one hour) to absorb the common, smooth increases in the loading, and a novel fine-grain replenishment of the over-provisioned reserves at fine time granularities (e.g., five minutes) to handle the uncommon, abrupt loading surges. Using real-world traces, we show that combining PowerTrade and SurgeGuard reduces total power by 30% compared to previous low-power schemes while maintaining response times within 1.7%.

References

  1. AirPAK. Computational fluid dynamics (CFD) software by Ansys Inc. http://www.ansys.com/products/airpak/default.asp.Google ScholarGoogle Scholar
  2. O. Allen. Probability, statistics and queuing theory with computer science applications. 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. L. A. Barroso and U. Hölzle. The case for energy-proportional computing. IEEE Computer, 40(12):33--37, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. C. Belady. Green grid data center power efficiency metrics, PUE and DCIE. White paper: Metrics & Measurements. http://www.thegreengrid.org, 2007.Google ScholarGoogle Scholar
  5. P. Bohrer, E. Elnozahy, T. Keller, M. Kistler, C. Lefurgy, C. McDowell, and R. Rajamony. The case for power management in web servers. Power aware computing, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. G. Bolch, S. Greiner, H. Meer, and K. S. Trivedi. Queuing networks and markov chains: Modeling and performance evaluation with computer science applications. 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. S. Chase, D. C. Anderson, P. N. Thakar, A. M. Vahdat, and R. P. Doyle. Managing energy and server resources in hosting centers. In SOSP '01: Proceedings of the eighteenth ACM symposium on Operating systems principles, pages 103--116, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Y. Chen, A. Das, W. Qin, A. Sivasubramaniam, Q. Wang, and N. Gautam. Managing server energy and operational costs in hosting centers. In SIGMETRICS '05: Proceedings of ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, vol. 33, pages 303--314, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Choi, Y. Kim, A. Sivasubramaniam, J. Srebric, Q. Wang, and J. Lee. Modeling and managing thermal profiles of rack mounted servers with thermostat. In HPCA '07: High Performance Computer Architecture, pages 205--215, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Standard Performance Evaluation Corporation. http://www.spec.org/power_ssj2008/results/power_ssj2008.html.Google ScholarGoogle Scholar
  11. Linux Documentation. latest release 2.6, http://www.kernel.org/doc/documentation/power/states.txt.Google ScholarGoogle Scholar
  12. E. Elnozahy, M. Kistler, and R. Rajamony. Energy-efficient server clusters. In Proceedings of the 2nd Workshop on Power-Aware Computing Systems, pages 179--196, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. U.S. EPA. Report to congress on server and data center energy efficiency. In U.S. Environmental Protection Agency, Tech Report, 2007.Google ScholarGoogle Scholar
  14. X. Fan, W.-D. Weber, and L. A. Barroso. Power provisioning for a warehouse-sized computer. In 34th International Symposium on Computer Architecture, pages 13--23, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. J. Hamilton. Internet-scale service infrastructure efficiency. In ISCA '09 keynote, http://perspectives.mvdirona.com/2008/11/28/CostOfPowerInLargeScaleDataCenters.aspx, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. T. Heath, B. Diniz, E. V. Carrera, W. M. Jr., and R. Bianchini. Energy conservation in heterogeneous server clusters. In PPoPP '05: 10th ACM SIGPLAN symposium on Principles and practice of parallel programming, pages 186--195, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Traces in the Internet Traffic Archive. http://ita.ee.lbl.gov/html/traces.html.Google ScholarGoogle Scholar
  18. W. LeFebvre. CNN.com: Facing a world crisis. http://www.tcsa.org/lisa2001/cnn.txt.Google ScholarGoogle Scholar
  19. K. Lim, P. Ranganathan, J. Chang, C. Patel, T. Mudge, and S. Reinhardt. Understanding and designing new server architectures for emerging warehouse-computing environments. In ISCA '08: Proceedings of the 35th International Symposium on Computer Architecture, pages 315--326, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. D. Meisner, B. T. Gold, and T. F. Wenisch. Powernap: Eliminating server idle power. In ASPLOS '09: Proceeding of the 14th conference on Architectural support for programming languages and operating systems, pages 205--216, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. Moore, J. Chase, P. Ranganathan, and R. Sharma. Making scheduling "cool": Temperature-aware workload placement in data centers. In Proceedings of USENIX, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. J. Moore, J. Chase, and P. Ranganathan. Weatherman: Automated, online and predictive thermal mapping and management for data centers. In ICAC '06: Proceedings of IEEE Conference on Autonomic Computing, pages 155--164, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. T. Mukherjee, Q. Tang, C. Ziesman, S. K. S. Gupta, and P. Cayton. Software architecture for dynamic thermal management in data centers. In COMSWARE, 2007.Google ScholarGoogle Scholar
  24. R. Nathuji and K. Schwan. Virtualpower: Coordinated power management in virtualized enterprise systems. In SOSP '07: Proceedings of 21st ACM SIGOPS symposium on Operating systems principles, pages 265--278, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. C. D. Patel, C. E. Bash, R. K. Sharma, and A. Beitelmal. Smart cooling of data centers. In Proceedings of the Pacific RIM/ASME International Electronics Packaging Technical Conference and Exhibition, IPACK, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  26. E. Pinheiro, R. Bianchini, E. V. Carrera, and T. Heath. Load balancing and unbalancing for power and performance in cluster-based systems. In Workshop on Compilers and Operating Systems for Low Power, 2001.Google ScholarGoogle Scholar
  27. R. Raghavendra, P. Ranganathan, V. Talwar, Z. Wang, and X. Zhu. No "power" struggles: Coordinated multi-level power management for the data center. In ASPLOS XIII: Proceedings of the 13th conference on Architectural support for programming languages and operating systems, pages 48--59, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. L. Ramos and R. Bianchini. C-Oracle: Predictive thermal management for data centers. In Proceedings of High-Performance Computer Architecture, HPCA, pages 111--122, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  29. P. Ranganathan, P. Leech, D. Irwin, and J. Chase. Ensemble-level power management for dense blade servers. ISCA '06: Proceedings of the 33rd annual international symposium on Computer Architecture, 34(2):66--77, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. C. Rusu, A. Ferreira, C. Scordino, and A. Watson. Energy-efficient real-time heterogeneous server clusters. In RTAS '06: Proceedings of the 12th IEEE Real-Time and Embedded Technology and Applications Symposium, pages 418--428, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. R. K. Sharma, C. Bash, C. D. Patel, R. Friedrich, and J. Chase. Balance of power: Dynamic thermal management for internet data centers. IEEE Internet Computing, 9(1):42--49, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. V. Sharma, A. Thomas, T. Abdelzaher, K. Skadron, and Z. Lu. Power-aware QoS management in web servers. In RTSS '03: Proceedings of the 24th International IEEE Real-Time Systems Symposium, pages 63--72, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. R. F. Sullivan. Alternating cold and hot aisles provides more reliable cooling for server farms. In Uptime Institute, 2000.Google ScholarGoogle Scholar
  34. B. Urgaonkar, P. Shenoy, and T. Roscoe. Resource overbooking and application profiling in shared hosting platforms. SIGOPS Oper. Syst. Rev., 36(SI):239--254, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. P. Welch. On a generalized M/G/1 queuing process in which the first customer of each busy period receives exceptional service. In Operations Research, vol. 12, pages 736--752, 1964.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Joint optimization of idle and cooling power in data centers while maintaining response time

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM SIGPLAN Notices
          ACM SIGPLAN Notices  Volume 45, Issue 3
          ASPLOS '10
          March 2010
          399 pages
          ISSN:0362-1340
          EISSN:1558-1160
          DOI:10.1145/1735971
          Issue’s Table of Contents
          • cover image ACM Conferences
            ASPLOS XV: Proceedings of the fifteenth International Conference on Architectural support for programming languages and operating systems
            March 2010
            422 pages
            ISBN:9781605588391
            DOI:10.1145/1736020
            • General Chair:
            • James C. Hoe,
            • Program Chair:
            • Vikram S. Adve

          Copyright © 2010 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 13 March 2010

          Check for updates

          Qualifiers

          • research-article

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader