OptimizingMARL: Developing Cooperative Game Environments Based on Multi-agent Reinforcement Learning

Ferreira, Thaís; Clua, Esteban; Kohwalter, Troy Costa; Santos, Rodrigo

doi:10.1007/978-3-031-20212-4_7

Thaís Ferreira¹¹,
Esteban Clua¹¹,
Troy Costa Kohwalter¹¹ &
…
Rodrigo Santos¹²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13477))

Included in the following conference series:

International Conference on Entertainment Computing

811 Accesses

Abstract

Intelligent agents are critical components of the current game development state of the art. With advances in hardware, many games can simulate cities and ecosystems full of agents. These environments are known as multi-agent environments. In this domain, reinforcement learning has been explored to develop artificial agents in games. In reinforcement learning, the agent must discover which actions lead to greater rewards by experimenting with these actions and defining a search by trial and error. Specifying when to reward agents is not a simple task and requires knowledge about the environment and the problem to be solved. Furthermore, defining the elements of multi-agent reinforcement learning required for the learning environment can be challenging for developers who are not domain experts. This paper proposes a framework for developing multi-agent cooperative game environments to facilitate the process and improve agent performance during reinforcement learning. The framework consists of steps for modeling the learning environment and designing rewards and knowledge distribution, trying to achieve the best environment configuration for training. The framework was applied to the development of three multi-agent environments, and tests were conducted to analyze the techniques used in reward design. The results show that the use of frequent rewards favors the emergence of essential behaviors (necessary for the resolution of tasks), improving the learning of agents. Although the knowledge distribution can reduce task complexity, dependency between groups is a decisive factor in its implementation.

This work is supported by CAPES and FAPERJ.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bellemare, M.G., Naddaf, Y., Veness, J., Bowling, M.: The arcade learning environment: an evaluation platform for general agents. J. Artif. Intell. Res. 47(1), 253–279 (2013)
Article Google Scholar
Berner, C., et al.: Dota 2 with large scale deep reinforcement learning (2019)
Google Scholar
Cohen, A., et al.: On the use and misuse of absorbing states in multi-agent reinforcement learning (2021)
Google Scholar
Foerster, J., Assael, Y., Freitas, N., Whiteson, S.: Learning to communicate with deep multi-agent reinforcement learning. In: Proceedings of the 30th International Conference on Neural Information Processing Systems, pp. 2145–2153. NIPS’16, Curran Associates Inc. (2016)
Google Scholar
Foerster, J., Farquhar, G., Afouras, T., Nardelli, N., Whiteson, S.: Counterfactual multi-agent policy gradients. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32, pp. 2974–2982. AAAI (2018)
Google Scholar
Hessel, M., et al.: Rainbow: Combining improvements in deep reinforcement learning. In: Proceedings of the 32nd AAAI Conference on Artificial Intelligence, vol. 32, pp. 3215–3222. PKP Publishing Services Network (2018)
Google Scholar
Iqbal, S., Sha, F.: Actor-attention-critic for multi-agent reinforcement learning. In: Proceedings of the 36th International Conference on Machine Learning, pp. 2961–2970. PMLR 97, Long Beach, California (2019)
Google Scholar
Johnson, M., Hofmann, K., Hutton, T., Bignell, D.: The malmo platform for artificial intelligence experimentation. In: Proceedings of the 25th International Joint Conference on Artificial Intelligence, pp. 4246–4247. IJCAI’16, AAAI Press (2016)
Google Scholar
Jorge, E., Kågebäck, M., Johansson, F., Gustavsson, E.: Learning to play guess who? and inventing a grounded language as a consequence (2016)
Google Scholar
Kaelbling, L., Littman, M., Moore, A.: Reinforcement learning: a survey. J. Artif. Intell. Res. 4, 237–285 (1996)
Article Google Scholar
Kempka, M., Wydmuch, M., Runc, G., Toczek, J., Jaśkowski, W.: Vizdoom: A doom-based ai research platform for visual reinforcement learning. In: IEEE Conference on Computational Intelligence and Games (CIG), pp. 1–8 (2016)
Google Scholar
Mnih, V., et al.: Playing atari with deep reinforcement learning (2013)
Google Scholar
Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature 518, 529–533 (2015)
Google Scholar
Schulman, J., Wolski, F., Dhariwal, P., Radford, A., Klimov, O.: Proximal policy optimization algorithms (2017)
Google Scholar
Summerville, A., et al.: Procedural content generation via machine learning (pcgml). IEEE Trans. Games 10(3), 257–270 (2018)
Article Google Scholar
Sunehag, P., et al.: Value-decomposition networks for cooperative multi-agent learning based on team reward. In: Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems. pp. 2085–2087. AAMAS ’18, International Foundation for Autonomous Agents and Multiagent Systems, Richland, SC (2016)
Google Scholar
Sutton, R., Barto, A.: Reinf. Learn.: Introduction. MIT Press, London, England (2018)
Google Scholar
Vidhate, D., Kulkarni, P.: Enhanced cooperative multi-agent learning algorithms (ecmla) using reinforcement learning. In: 2016 International Conference on Computing, Analytics and Security Trends (CAST), pp. 556–561. IEEE (2016)
Google Scholar
Vinyals, O., et al.: Grandmaster level in Starcraft II using multi-agent reinforcement learning. Nature 575, 350–354 (2019)
Article Google Scholar
Yannakakis, G., Togelius, J.: Artificial Intelligence and Games. Springer (2018). https://doi.org/10.1007/978-3-319-63519-4
Zhang, Q., Zhao, D., Lewis, F.: Model-free reinforcement learning for fully cooperative multi-agent graphical games. In: 2018 International Joint Conference on Neural Networks (IJCNN), pp. 1–6. IEEE (2018)
Google Scholar
Zhao, Y., Borovikov, I., Rupert, J., Somers, C., Bierami, A.: On multi-agent learning in team sports games. In: Proceedings of the 36th International Conference on Machine Learning (ICML) (2019)
Google Scholar

Download references

Acknowledgments

The authors would like to thank NVIDIA, CAPES and FAPERJ for the financial support.

Author information

Authors and Affiliations

Universidade Federal Fluminense, Niteroi, Brazil
Thaís Ferreira, Esteban Clua & Troy Costa Kohwalter
Universidade Federal do Estado do Rio de Janeiro, Rio de Janeiro, Brazil
Rodrigo Santos

Authors

Thaís Ferreira
View author publications
You can also search for this author in PubMed Google Scholar
Esteban Clua
View author publications
You can also search for this author in PubMed Google Scholar
Troy Costa Kohwalter
View author publications
You can also search for this author in PubMed Google Scholar
Rodrigo Santos
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Thaís Ferreira .

Editor information

Editors and Affiliations

University of Vienna, Vienna, Austria
Barbara Göbl
Eindhoven University of Technology, Eindhoven, The Netherlands
Erik van der Spek
Royal Institute of Technology, Södertälje, Sweden
Jannicke Baalsrud Hauge
Luxembourg Institute of Science and Technology, Esch-Sur-Alzette, Luxembourg
Rod McCall

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ferreira, T., Clua, E., Kohwalter, T.C., Santos, R. (2022). OptimizingMARL: Developing Cooperative Game Environments Based on Multi-agent Reinforcement Learning. In: Göbl, B., van der Spek, E., Baalsrud Hauge, J., McCall, R. (eds) Entertainment Computing – ICEC 2022. ICEC 2022. Lecture Notes in Computer Science, vol 13477. Springer, Cham. https://doi.org/10.1007/978-3-031-20212-4_7

Download citation

DOI: https://doi.org/10.1007/978-3-031-20212-4_7
Published: 24 October 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20211-7
Online ISBN: 978-3-031-20212-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Federation for Information Processing (opens in a new tab)

OptimizingMARL: Developing Cooperative Game Environments Based on Multi-agent Reinforcement Learning