Skip to main content

Advertisement

Log in

Deep learning-based multivariate resource utilization prediction for hotspots and coldspots mitigation in green cloud data centers

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Dynamic virtual machine (VM) consolidation is a constructive technique to enhance resource usage and is extensively employed to minimize data centers’ energy consumption. However, in the current approaches, consolidation techniques are heavily relied on reducing the actively used physical servers (PMs) based on their current resource utilization without considering future resource demands. Also, many of the reported works for cloud workload prediction applied univariate time series-based forecasting models and neglected the dependency of other resource utilization metrics. Thus, resulting in inaccurate predictions, unnecessary migrations, high migration costs, and increased service level agreement violations (SLAVs) may nullify the consolidation benefits. To efficiently address this issue, we propose a multivariate resource usage prediction-based hotspots and coldspots mitigation approach that considers both the current and future usage of resources with O(sk) time complexity, where s and k denote the number of PMs and VMs, respectively. The proposed technique uses a clustering-based stacked bidirectional (Long Short-Term Memory) LSTM deep learning network to predict the future memory and CPU usage of PMs and VMs with high accuracy and \(O((Q(Q+W)*\Theta )\) computational complexity, where Q, W, and \(\Theta \) represent the number of hidden layer cells, outputs, and training epochs, respectively. Through extensive simulations based on Google’s cluster workload traces, we demonstrate that our proposed method obtains substantial improvements in terms of prediction performance, energy-efficiency, actively used PMs, VM migrations, and SLA violations over the benchmark approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23

Similar content being viewed by others

References

  1. Buyya R, Yeo CS, Venugopal S, Broberg J, Brandic I (2009) Cloud computing and emerging IT platforms: vision, hype, and reality for delivering computing as the 5th utility. Future Gener Comput Syst 25(6):599–616

    Article  Google Scholar 

  2. VMwareInc (2009) How VMware virtualization right sizes IT infrastructure to reduce power consumption. VMware Inc, Palo Alto, CA

  3. Paul B, Boris D, Keir F, Steven H, Tim H, Alex H, Rolf N, Ian P, Andrew W (2003) Xen and the art of virtualization. SIGOPS Oper Syst Rev 37(5):164–177

    Article  Google Scholar 

  4. Beloglazov A, Buyya R, Lee YC, Zomaya A (2011) A taxonomy and survey of energy-efficient data centers and cloud computing systems. Adv. Comput. 82:47–111

    Article  Google Scholar 

  5. Asad Z, Chaudhry MAR (2017) A two-way street: green big data processing for a greener smart grid. IEEE Syst. J. 11(2):784–795

    Article  Google Scholar 

  6. Gartner Inc (2007) Gartner estimates ICT industry accounts for 2 percent of global CO2 emissions. Gartner Press Release

  7. Global warming: Data centres to consume three times as much energy in next decade, experts warn (2016) https://www.independent.co.uk/environment/global-warming-data-centres-consume-three-times-much-energy-next-decade-experts-warn-a6830086.html. Accessed 24 Aug 2020

  8. How to stop data centres from gobbling up the world’s electricity (2018) https://www.nature.com/articles/d41586-018-06610-y. Accessed 27 Aug 2020

  9. Barroso LA, Hölzle U (2007) The case for energy-proportional computing. Computer 12:33–37

    Article  Google Scholar 

  10. Fan X, Weber WD, Barroso LA (2007) Power provisioning for a warehouse-sized computer. ACM SIGARCH Comput Archit News ACM 35:13–23

    Article  Google Scholar 

  11. Christopher C (2005) Live migration of virtual machines. In: Proceedings of the 2nd conference on symposium on networked systems design & implementation. Berkeley, CA, USA, pp 273–286

  12. Beloglazov A, Buyya R (2012) Optimal online deterministic algorithms and adaptive heuristics for energy and performance efficient dynamic consolidation of virtual machines in Cloud data centers. Concurr Comput Pract Exp 24(13):1397–1420

    Article  Google Scholar 

  13. Timothy W, Prashant S, Arun V, Mazin Y (2007) Black-box and gray-box strategies for virtual machine migration. In: Proc. 4th USENIX conference on networked systems design & implementation, NSDI’07, Cambridge, MA, USENIX Association, Berkeley, CA, USA, pp 229–242

  14. Bin-packing (2006) In: Proc. combinatorial optimization, ser. Algorithms and combinatorics, vol 21. Springer Berlin Heidelberg, pp 426-441

  15. Khan MA, Paplinski A, Khan AM, Murshed M, Buyya R (2018) Dynamic virtual machine consolidation algorithms for energy-efficient cloud resource management: a review. Sustainable cloud and energy services. Springer, New York, pp 135–165

    Chapter  Google Scholar 

  16. Zhou Q, Xu M, Gill SS, Gao C, Tian W, Xu C, Buyya R (2020) Energy efficient algorithms based on VM consolidation for cloud computing: comparisons and evaluations. In: proc. 20th IEEE/ACM international symposium on cluster, cloud and internet computing (CCGRID). Melbourne, Australia, pp 489–498

  17. Calheiros RN, Ranjan R, Beloglazov A, De Rose CAF, Buyya R (2011) CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms. Softw. Pract. Exp. 41(1):23–50

    Article  Google Scholar 

  18. Farahnakian F, Liljeberg P, Plosila J (2013) LiRCUP: linear regression based CPU usage prediction algorithm for live migration of virtual machines in data centers. In: Proc. 39th euromicro conference on software engineering and advanced applications. Santander, pp 357–364

  19. Farahnakian F, Pahikkala T, Liljeberg P, Plosila J (2013) Energy aware consolidation algorithm based on k-nearest neighbor regression for cloud data centers. In: Proc. IEEE/ACM 6th international conference on utility and cloud computing. Dresden, pp 256–259

  20. Farahnakian F, Ashraf A, Pahikkala T, Liljeberg P, Plosila J, Porres I, Tenhunen H (2015) Using ant colony system to consolidate VMs for green cloud computing. IEEE Trans Ser Comput 8(2):187–198

    Article  Google Scholar 

  21. Farahnakian F, Pahikkala T, Liljeberg P, Plosila J, Hieu NT, Tenhunen H (2019) Energy-aware VM consolidation in cloud data centers using utilization prediction model. IEEE Trans Cloud Comput 7(2):524–536

    Article  Google Scholar 

  22. Moghaddam SM, O’Sullivan M, Walker C, Piraghaj SF, Unsworth CP (2020) Embedding individualized machine learning prediction models for energy efficient VM consolidation within Cloud data centers. Future Gener Comput Syst 106:221–233

  23. Haghshenas K, Mohammadi S (2020) Prediction-based underutilized and destination host selection approaches for energy-efficient dynamic VM consolidation in data centers. J Supercomput 76:10240–10257

    Article  Google Scholar 

  24. Hsieh S-Y, Liu C-S, Buyya R, Zomaya AY (2020) Utilization-prediction-aware virtual machine consolidation approach for energy-efficient cloud data centers. J Parallel Distrib Comput 139:99–109

    Article  Google Scholar 

  25. Tarafdar A, Debnath M, Khatua S et al (2020) Energy and quality of service-aware virtual machine consolidation in a cloud data center. J Supercomput. 76:9095–9126

    Article  Google Scholar 

  26. Lu S, Chen J (2018) Host overloading detection based on EWMA algorithm in cloud computing environment. In: Proc. IEEE 15th international conference on e-business engineering (ICEBE). Xi’An, China, pp 274–279

  27. Donnell NM, Howley E, Duggan J (2020) Dynamic virtual machine consolidation using a multi-agent system to optimise energy efficiency in cloud computing. Future Gener Comput Syst 108:288–301

    Article  Google Scholar 

  28. Monshizadeh Naeen H, Zeinali E, Toroghi Haghighat A (2020) Adaptive Markov-based approach for dynamic virtual machine consolidation in cloud data centers with quality-of-service constraints. Softw Pract Exp 50:161–183

    Article  Google Scholar 

  29. Li Z, Xinrong Y, Lei Y, Guo S, Chang V (2020) Energy-efficient and quality-aware VM consolidation method. Future Gener Comput Syst 102:789–809

    Article  Google Scholar 

  30. El-Moursy A, Abdelsamea A, Kamran R, Saad M (2019) Multi-dimensional regression host utilization algorithm (MDRHU) for host overload detection in cloud computing. J Cloud Comput 8:1–17

    Article  Google Scholar 

  31. Ranjbari M, Torkestani JA (2018) A learning automata-based algorithm for energy and SLA efficient consolidation of virtual machines in cloud data centers. J Parallel Distrib Comput 113:55–62

    Article  Google Scholar 

  32. Subramanian S, Kannammal A (2019) Real time non-linear cloud workload forecasting using the holt-winter model. In: Proc. 10th international conference on computing, communication and networking technologies (ICCCNT). Kanpur, India, pp 1–6

  33. Qi Z, Mohamed FZ, Shuo Z, Quanyan Z, Raouf B, Joseph LH (2012) Dynamic energy-aware capacity provisioning for cloud computing environments. In: Proc. international conference on autonomic computing (ICAC). ACM, New York, NY, USA, pp 145–154

  34. Podolskiy V, Jindal A, Gerndt M, Oleynik Y (2018) Forecasting models for self-adaptive cloud applications: a comparative study. In: Proc. IEEE 12th international conference on self-adaptive and self-organizing systems (SASO). Trento, Italy, pp 40–49

  35. Caglar F, Gokhale A (2014) iOverbook: intelligent resource overbooking to support soft real-time applications in the cloud. In: Proc. IEEE 7th international conference on cloud computing (CLOUD). IEEE, Anchorage, AK, USA, pp 538–545

  36. Gong Z, Gu X, Wilkes J (2010) PRESS: PRedictive elastic resource scaling for cloud systems. In: Proc. international conference on network and service management (CNSM). IEEE, Niagara Falls, ON, Canada, pp 9–16

  37. Nguyen H, Shen Z, Gu X, Subbiah S, Wilkes J (2013) AGILE: elastic distributed resource scaling for infrastructure-as-a-service. In: Proc. 10th international conference on autonomic computing (ICAC). San Jose, CA, USENIX, pp 69–82

  38. Ghorbani M, Wang Y, Xue Y, Pedram M, Bogdan P (2014) Prediction and control of bursty cloud workloads: a fractal framework. In: Proc. international conference on hardware/software codesign and system synthesis (CODES+ISSS), pp 1–9

  39. Reiss C, Wilkes J, Hellerstein JL (2011) Google cluster-usage traces: format + schema. https://github.com/google/cluster-data

  40. Song B, Yu Y, Zhou Y et al (2018) Host load prediction with long short-term memory in cloud computing. J Supercomput. 74:6554–6568

    Article  Google Scholar 

  41. Chakraborty K, Mehrotra K, Mohan CK, Ranka S (1992) Forecasting the behavior of multivariate time series using neural networks. Neural Netw. 5(6):961–970

    Article  Google Scholar 

  42. Aboagye-Sarfo P et al (2015) A comparison of multivariate and univariate time series approaches to modelling and forecasting emergency department demand in Western Australia. J Biomed Inf 57:62–73

    Article  Google Scholar 

  43. Dannecker L (2015) Energy time series forecasting: efficient and accurate forecasting of evolving time series from the energy domain, 1st edn. Springer, Berlin

    Book  Google Scholar 

  44. Sun Y, Li J, Liu J, Chow C, Sun B, Wang R (2015) Using causal discovery for feature selection in multivariate numerical time series. Mach Learn 101(1–3):377–395

    Article  MathSciNet  MATH  Google Scholar 

  45. Zhao Y, Ye L, Li Z, Song X, Lang Y, Su J (2016) A novel bidirectional mechanism based on time series model for wind power forecasting. Appl Energy 177:793–803

    Article  Google Scholar 

  46. Wakuya H, Shida K (2001) Bi-directionalization of neural computing architecture for time series prediction. III. Application to laser intensity time record Data Set A. In: Proc. international joint conference on neural networks (IJCNN). IEEE, Washington, DC, USA, pp 2098–2103

  47. Cui Z, Ke R, Pu Z, Wang Y (2020) Stacked bidirectional and unidirectional LSTM recurrent neural network for forecasting network-wide traffic state with missing values. Transp Res Part C Emerg Technol 118:102674

    Article  Google Scholar 

  48. Gupta S, Dinesh DA (2017) Resource usage prediction of cloud workloads using deep bidirectional long short term memory networks. In: Proc. international conference on advanced networks and telecommunications systems (ANTS). IEEE, Bhubaneswar, pp 1–6

  49. Gupta S, Dileep AD, Gonsalves TA (2018) A joint feature selection framework for multivariate resource usage prediction in cloud servers using stability and prediction performance. J Supercomput 74(11):6033–6068

    Article  Google Scholar 

  50. Beloglazov A, Abawajy J, Buyya R (2012) Energy-aware resource allocation heuristics for efficient management of data centers for cloud computing. Future Gener Comput Syst 28(5):755–768

    Article  Google Scholar 

  51. Standard Performance Evaluation Corporation (SPEC) (2008), http://www.spec.org

  52. Blackburn M, Grid G (2008) Five ways to reduce data center server power consumption (white paper). The Green Grid

  53. Boutaba R, Zhang Q, Zhani MF (2014) Virtual machine migration in cloud computing environments: benefits, challenges, and approaches. Communication infrastructures for cloud computing. IGI Global, Pennsylvania, pp 383–408

    Chapter  Google Scholar 

  54. Zakarya M, Gillam L (2019) Managing energy, performance and cost in large scale heterogeneous datacenters using migrations. Future Gener Comput Syst. 93:529–547

    Article  Google Scholar 

  55. Schuster M, Paliwal KK (1997) Bidirectional recurrent neural networks. IEEE Trans Signal Process 45(11):2673–2681

    Article  Google Scholar 

  56. Wang M, Meng X, Zhang L (2011) Consolidating virtual machines with dynamic bandwidth demand in data centers. In: 2011 Proceedings IEEE INFOCOM, vol 201. pp 71–75

  57. Behera S, Misra R, Sillitti A (2021) Multiscale deep bidirectional gated recurrent neural networks based prognostic method for complex non-linear degradation systems. Inf Sci 554:120–144

    Article  MathSciNet  Google Scholar 

  58. Murtazaev Aziz, Sangyoon Oh (2011) Sercon: server consolidation algorithm using live migration of virtual machines for green computing. IETE Tech Rev 28(3):212–231

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yashwant Singh Patel.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Patel, Y.S., Jaiswal, R. & Misra, R. Deep learning-based multivariate resource utilization prediction for hotspots and coldspots mitigation in green cloud data centers. J Supercomput 78, 5806–5855 (2022). https://doi.org/10.1007/s11227-021-04107-6

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-021-04107-6

Keywords

Navigation