Improvement of Systems Management Policies Using Hybrid Reinforcement Learning

Tesauro, Gerald; Jong, Nicholas K.; Das, Rajarshi; Bennani, Mohamed N.

doi:10.1007/11871842_80

Gerald Tesauro²¹,
Nicholas K. Jong²²,
Rajarshi Das²¹ &
…
Mohamed N. Bennani²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4212))

Included in the following conference series:

European Conference on Machine Learning

5448 Accesses
1 Citations

Abstract

Reinforcement Learning (RL) holds particular promise in an emerging application domain of performance management of computing systems. In recent work, online RL yielded effective server allocation policies in a prototype Data Center, without explicit system models or built-in domain knowledge. This paper presents a substantially improved and more practical “hybrid” approach, in which RL trains offline on data collected while a queuing-theoretic policy controls the system. This approach avoids potentially poor performance in live online training. Additionally we use nonlinear function approximators instead of tabular value functions; this greatly improves scalability, and surprisingly, eliminated the need for exploratory actions. In experiments using both open-loop and closed-loop traffic as well as large switching delays, our results show significant performance improvement over state-of-art queuing model policies.

Download to read the full chapter text

Chapter PDF

Cloud Resource Allocation from the User Perspective: A Bare-Bones Reinforcement Learning Approach

Random task scheduling scheme based on reinforcement learning in cloud computing

Article 11 September 2015

An adaptive RL based approach for dynamic resource provisioning in Cloud virtualized data centers

Article 03 May 2015

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Das, R., Tesauro, G., Walsh, W.E.: Model-based and model-free approaches to autonomic resource allcation. Technical Report RC23802, IBM Research (2005)
Google Scholar
Tesauro, G.: Online resource allocation using decompositional reinforcement learning. In: Proc. of AAAI 2005. AAAI Press, Menlo Park (2005)
Google Scholar
Vengerov, D., Iakovlev, N.: A reinforcement learning framework for dynamic resource allocation: First results. In: Proc. of ICAC 2005 (2005)
Google Scholar
Price, B., Boutilier, C.: Accelerating reinforcement learning through implicit imitation. J. of AI Research 19, 569–629 (2003)
MATH Google Scholar
Lavenberg, S.S.: Personal communication (2006)
Google Scholar
Tesauro, G., Jong, N.K., Das, R., Bennani, M.N.: A hybrid reinforcement learning approach to autnomic resource allocation. In: Proc. of ICAC 2006, pp. 65–73 (2006)
Google Scholar
Squillante, M.S., Yao, D.D., Zhang, L.: Internet traffic: Periodicity, tail behavior and performance implications. In: Gelenbe, E. (ed.) System Performance Evaluation: Methodologies and Applications. CRC Press, Boca Raton (1999)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
Google Scholar
Baird, L.: Residual algorithms: Reinforcement learning with function approximation. In: Proc. of ICML 1995 (1995)
Google Scholar
Abbeel, P., Ng, A.Y.: Exploration and apprenticeship learning in reinforcement learning. In: Proc. of ICML 2005 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

IBM TJ Watson Research Center, 19 Skyline Drive, Hawthorne, NY, 10532, USA
Gerald Tesauro & Rajarshi Das
Dept. of Computer Sciences, Univ. of Texas, Austin, TX, 78712, USA
Nicholas K. Jong
Dept. of Computer Science, George Mason Univ., Fairfax, VA, 22030, USA
Mohamed N. Bennani

Authors

Gerald Tesauro
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas K. Jong
View author publications
You can also search for this author in PubMed Google Scholar
Rajarshi Das
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed N. Bennani
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Knowledge Engineering Group, Technische Universität Darmstadt,
Johannes Fürnkranz
Max Planck Institute for Computer Science, Saarbrücken, Germany
Tobias Scheffer
Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Germany
Myra Spiliopoulou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tesauro, G., Jong, N.K., Das, R., Bennani, M.N. (2006). Improvement of Systems Management Policies Using Hybrid Reinforcement Learning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds) Machine Learning: ECML 2006. ECML 2006. Lecture Notes in Computer Science(), vol 4212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11871842_80

Download citation

DOI: https://doi.org/10.1007/11871842_80
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45375-8
Online ISBN: 978-3-540-46056-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Improvement of Systems Management Policies Using Hybrid Reinforcement Learning

Abstract

Chapter PDF

Similar content being viewed by others

Cloud Resource Allocation from the User Perspective: A Bare-Bones Reinforcement Learning Approach

Random task scheduling scheme based on reinforcement learning in cloud computing

An adaptive RL based approach for dynamic resource provisioning in Cloud virtualized data centers

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Improvement of Systems Management Policies Using Hybrid Reinforcement Learning

Abstract

Chapter PDF

Similar content being viewed by others

Cloud Resource Allocation from the User Perspective: A Bare-Bones Reinforcement Learning Approach

Random task scheduling scheme based on reinforcement learning in cloud computing

An adaptive RL based approach for dynamic resource provisioning in Cloud virtualized data centers

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation