Abstract
Today we are noticing a significant increase in energy costs used by High-Performance Computing. However, increasing demand for information processing have led to cheaper, faster and larger data management systems. This demand requires employing more hardware and software to meet the service needs which in turn put further pressure on energy costs. In data-centric applications, DBMSs are one of the major energy consumers. So faced to this situation, integrating energy in the database design becomes an economic necessity. To satisfy this key requirement, the development of cost models estimating the energy consumption is one of the relevant issues. While a number of recent papers have explored this problem, the majority of the existing work considers prediction energy for a single standalone query. In this paper, we consider a more general problem of multiple concurrently running queries. This is useful for many database management’s tasks, including admission control, query scheduling and execution control with energy efficiency as a first-class performance goal. We propose a methodology to define an energy-consumption cost model to estimate the cost of executing concurrent workload via statistical regression techniques. We first use the optimizer’s cost model to estimate the I/O and CPU requirements for each query pipeline in the workload, then we fit statistical models to the observed energy at these query pipelines, finally we use the combination of these models to predict concurrent workload energy consumption. To evaluate the quality of our cost model, we conduct experiments using a real DBMS with a dataset of TPC-H and TPC-DS benchmarks. The obtained results show the quality of our cost model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
The datasets and the system setup we used in our experiments can be found in Sect. 3.
- 3.
References
Agrawal, R., Ailamaki, A., Bernstein, P.A., Brewer, E.A., Carey, M.J., Chaudhuri, S., et al.: The claremont report on database research. ACM SIGMOD Rec. 37(3), 9–19 (2008)
Ahmad, M., Duan, S., et al.: Predicting completion times of batch query workloads using interaction-aware models and simulation. In: EDBT, pp. 449–460. ACM (2011)
Alonso, R., Ganguly, S.: Energy efficient query optimization. In: Matsushita Info Tech Lab, Citeseer (1992)
Beloglazov, A., Buyya, R., Lee, Y.C., et al.: A taxonomy and survey of energy-efficient data centers and cloud computing systems. Adv. Comput. 82(2), 47–111 (2011)
Chaudhuri, S., Narasayya, V., Ramamurthy, R.: Estimating progress of execution for sql queries. In: SIGMOD, pp. 803–814. ACM (2004)
Garcia-Molina, H., Ullman, J.D., Widom, J.: Database System Implementation, vol. 654. Prentice Hall, Upper Saddle River (2000)
Harizopoulos, S., Shah, M., Meza, J., Ranganathan, P.: Energy efficiency: the new holy grail of data management systems research. arXiv preprint arXiv:0909.1784 (2009)
Intel, Oracle: Oracle exadata on intel\(^{\textregistered }\) xeon\(^{\textregistered }\) processors: Extreme performance for enterprise computing. White paper (2011)
Koomey, J.: Growth in data center electricity use 2005 to 2010. A report by Analytical Press, completed at the request of The New York Times (2011)
Kunjir, M., Birwa, P.K., et al.: Peak power plays in database engines. In: Proceedings of the 15th International Conference on Extending Database Technology, pp. 444–455. ACM (2012)
Lang, W., Kandhan, R., Patel, J.M.: Rethinking query processing for energy efficiency: slowing down to win the race. IEEE Data Eng. Bull. 34(1), 12–23 (2011)
Lang, W., Patel, J.: Towards eco-friendly database management systems. arXiv preprint arXiv:0909.1767 (2009)
Li, J., Nehme, R., Naughton, J.: Gslpi: a cost-based query progress indicator. In: 2012 IEEE 28th International Conference on Data Engineering (ICDE), pp. 678–689. IEEE (2012)
Luo, G., Naughton, J.F., Ellmann, C.J., Watzke, M.W.: Toward a progress indicator for database queries. In: SIGMOD, pp. 791–802. ACM (2004)
McCullough, J.C., Agarwal, Y., Chandrashekar, J., et al.: Evaluating the effectiveness of model-based power characterization. In: USENIX Annual Technical Conference (2011)
Poess, M., Nambiar, R.O.: Energy cost, the key challenge of today’s data centers: a power consumption analysis of tpc-c results. PVLDB 1(2), 1229–1240 (2008)
Poess, M., Nambiar, R.O., Walrath, D.: Why you should run tpc-ds: a workload analysis. In: VLDB, pp. 1138–1149. VLDB Endowment (2007)
Rodriguez-Martinez, M., Valdivia, H., Seguel, J., Greer, M.: Estimating power/energy consumption in database servers. Procedia Comput. Sci. 6, 112–117 (2011)
Roukh, A., Bellatreche, L.: Eco-processing of olap complex queries. In: Data Warehousing and Knowledge Discovery. Springer (2015, to appear)
Tu, Y.C., Wang, X., Zeng, B., Xu, Z.: A system for energy-efficient data management. ACM SIGMOD Rec. 43(1), 21–26 (2014)
Wang, J., Feng, L., Xue, W., Song, Z.: A survey on energy-efficient data management. ACM SIGMOD Rec. 40(2), 17–23 (2011)
Xu, Z., Tu, Y.C., Wang, X.: Exploring power-performance tradeoffs in database systems. In: ICDE, pp. 485–496 (2010)
Xu, Z., Tu, Y.C., Wang, X.: Dynamic energy estimation of query plans in database systems. In: ICDCS, pp. 83–92. IEEE (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Roukh, A. (2015). Estimating Power Consumption of Batch Query Workloads. In: Bellatreche, L., Manolopoulos, Y. (eds) Model and Data Engineering. Lecture Notes in Computer Science(), vol 9344. Springer, Cham. https://doi.org/10.1007/978-3-319-23781-7_16
Download citation
DOI: https://doi.org/10.1007/978-3-319-23781-7_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23780-0
Online ISBN: 978-3-319-23781-7
eBook Packages: Computer ScienceComputer Science (R0)