Heuristic Reinforcement Learning Applied to RoboCup Simulation Agents

Celiberto, Luiz A.; Ribeiro, Carlos H. C.; Costa, Anna H. R.; Bianchi, Reinaldo A. C.

doi:10.1007/978-3-540-68847-1_19

Luiz A. Celiberto Jr.^1,2,
Carlos H. C. Ribeiro²,
Anna H. R. Costa³ &
…
Reinaldo A. C. Bianchi¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5001))

Included in the following conference series:

Robot Soccer World Cup

1947 Accesses
10 Citations

Abstract

This paper describes the design and implementation of robotic agents for the RoboCup Simulation 2D category that learns using a recently proposed Heuristic Reinforcement Learning algorithm, the Heuristically Accelerated Q–Learning (HAQL). This algorithm allows the use of heuristics to speed up the well-known Reinforcement Learning algorithm Q–Learning. A heuristic function that influences the choice of the actions characterizes the HAQL algorithm. A set of empirical evaluations was conducted in the RoboCup 2D Simulator, and experimental results show that even very simple heuristics enhances significantly the performance of the agents.

Download to read the full chapter text

Chapter PDF

Adaptive Agents in Minecraft: A Hybrid Paradigm for Combining Domain Knowledge with Reinforcement Learning

Developing and Testing a New Reinforcement Learning Toolkit with Unreal Engine

Integrating Learning and Planning

Keywords

References

Bianchi, R.A.C., Ribeiro, C.H.C., Costa, A.H.R.: Heuristically Accelerated Q-Learning: a new approach to speed up reinforcement learning. In: Bazzan, A.L.C., Labidi, S. (eds.) SBIA 2004. LNCS (LNAI), vol. 3171, pp. 245–254. Springer, Heidelberg (2004)
Google Scholar
de Boer, R., Kok, J.: The Incremental Development of a Synthetic Multi-Agent System: The UvA Trilearn 2001 Robotic Soccer Simulation Team. Master’s Thesis, University of Amsterdam (2002)
Google Scholar
Kalyanakrishnan, S., Liu, Y., Stone, P.: Half field offense in RoboCup soccer: A multiagent reinforcement learning case study. In: Lakemeyer, G., Sklar, E., Sorenti, D., Takahashi, T. (eds.) RoboCup-2006: Robot Soccer World Cup X, Springer, Berlin (2007)
Google Scholar
Kitano, H., Minoro, A., Kuniyoshi, Y., Noda, I., Osawa, E.: Robocup: A challenge problem for ai. AI Magazine 18(1), 73–85 (1997)
Google Scholar
Littman, M.L., Szepesvári, C.: A generalized reinforcement learning model: Convergence and applications. In: Procs. of the Thirteenth International Conf. on Machine Learning (ICML 1996), pp. 310–318 (1996)
Google Scholar
Mitchell, T.: Machine Learning. McGraw Hill, New York (1997)
MATH Google Scholar
Noda, I.: Soccer server: a simulator of robocup. In: Proceedings of AI symposium of the Japanese Society for Artificial Intelligence, pp. 29–34 (1995)
Google Scholar
Spiegel, M.R.: Statistics. McGraw-Hill (1998)
Google Scholar
Szepesvári, C., Littman, M.L.: Generalized markov decision processes: Dynamic-programming and reinforcement-learning algorithms. Technical report, Brown University, Department of Computer Science, Brown University, Providence, Rhode Island 0, 1996. CS-96-11 (2912)
Google Scholar
Watkins, C.J.C.H.: Learning from Delayed Rewards. PhD thesis, University of Cambridge (1989)
Google Scholar

Download references

Author information

Authors and Affiliations

Centro Universitário da FEI, Av. Humberto de Alencar Castelo Branco, 3972, 09850-901, São Bernardo do Campo, SP, Brazil
Luiz A. Celiberto Jr. & Reinaldo A. C. Bianchi
Instituto Tecnológico de Aeronáutica, Praça Mal. Eduardo Gomes, 50, 12228-900, São José dos Campos, SP, Brazil
Luiz A. Celiberto Jr. & Carlos H. C. Ribeiro
Laboratório de Técnicas Inteligentes, Escola Politécnica da Universidade de São Paulo, Av. Prof. Luciano Gualberto, trav. 3, 158, 05508-900, São Paulo, SP, Brazil
Anna H. R. Costa

Authors

Luiz A. Celiberto Jr.
View author publications
You can also search for this author in PubMed Google Scholar
Carlos H. C. Ribeiro
View author publications
You can also search for this author in PubMed Google Scholar
Anna H. R. Costa
View author publications
You can also search for this author in PubMed Google Scholar
Reinaldo A. C. Bianchi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Ubbo Visser Fernando Ribeiro Takeshi Ohashi Frank Dellaert

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Celiberto, L.A., Ribeiro, C.H.C., Costa, A.H.R., Bianchi, R.A.C. (2008). Heuristic Reinforcement Learning Applied to RoboCup Simulation Agents. In: Visser, U., Ribeiro, F., Ohashi, T., Dellaert, F. (eds) RoboCup 2007: Robot Soccer World Cup XI. RoboCup 2007. Lecture Notes in Computer Science(), vol 5001. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68847-1_19

Download citation

DOI: https://doi.org/10.1007/978-3-540-68847-1_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68846-4
Online ISBN: 978-3-540-68847-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Heuristic Reinforcement Learning Applied to RoboCup Simulation Agents

Abstract

Chapter PDF

Similar content being viewed by others

Adaptive Agents in Minecraft: A Hybrid Paradigm for Combining Domain Knowledge with Reinforcement Learning

Developing and Testing a New Reinforcement Learning Toolkit with Unreal Engine

Integrating Learning and Planning

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Heuristic Reinforcement Learning Applied to RoboCup Simulation Agents

Abstract

Chapter PDF

Similar content being viewed by others

Adaptive Agents in Minecraft: A Hybrid Paradigm for Combining Domain Knowledge with Reinforcement Learning

Developing and Testing a New Reinforcement Learning Toolkit with Unreal Engine

Integrating Learning and Planning

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation