Calibrating a Motion Model Based on Reinforcement Learning for Pedestrian Simulation

Martinez-Gil, Francisco; Lozano, Miguel; Fernández, Fernando

doi:10.1007/978-3-642-34710-8_28

Calibrating a Motion Model Based on Reinforcement Learning for Pedestrian Simulation

Francisco Martinez-Gil¹⁸,
Miguel Lozano¹⁸ &
Fernando Fernández¹⁹

Conference paper

1823 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7660))

Abstract

In this paper, the calibration of a framework based in Multi-agent Reinforcement Learning (RL) for generating motion simulations of pedestrian groups is presented. The framework sets a group of autonomous embodied agents that learn to control individually its instant velocity vector in scenarios with collisions and friction forces. The result of the process is a different learned motion controller for each agent. The calibration of both, the physical properties involved in the motion of our embodied agents and the corresponding dynamics, is an important issue for a realistic simulation. The physics engine used has been calibrated with values taken from real pedestrian dynamics. Two experiments have been carried out for testing this approach. The results of the experiments are compared with databases of real pedestrians in similar scenarios. As a comparison tool, the diagram of speed versus density, known as fundamental diagram in the literature, is used.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agre, P., Chapman, D.: Pengi: An implementation of a theory of activity. In: 6th Nat. Conf. on Artificial Intelligence, pp. 268–272. Morgan Kaufmann (1987)
Google Scholar
Bierlaire, M., Robin, T.: Pedestrians choices. In: Timmermans, H. (ed.) Pedestrian Behavior, pp. 1–26. Emerald (2009)
Google Scholar
Claus, C., Boutilier, C.: The dynamics of reinforcement learning in cooperative multiagent systems. In: Proceedings of the Fifteenth National Conference on Artificial Intelligence, pp. 746–752. AAAI Press (1998)
Google Scholar
Curtis, S., Lin, M., Manocha, D.: Walk This Way: A Lightweight, Data-Driven Walking Synthesis Algorithm. In: Allbeck, J.M., Faloutsos, P. (eds.) MIG 2011. LNCS, vol. 7060, pp. 400–411. Springer, Heidelberg (2011)
Chapter Google Scholar
García, J., López-Bueno, I., Fernández, F., Borrajo, D.: A Comparative Study of Discretization Approaches for State Space Generalization in the Keepaway Soccer Task. In: Reinforcement Learning: Algorithms, Implementations and Aplications. Nova Science Publishers (2010)
Google Scholar
Helbing, D., Farkas, I., Vicsek, T.: Simulating dynamical features of escape panic. Nature 407, 487 (2000)
Article Google Scholar
Helbing, D., Johansson, A.: Pedestrian, Crowd and Evacuation Dynamics. In: Encyc. of Complex. and Systems Science, Part 16, pp. 6476–6495. Springer (2009)
Google Scholar
Helbing, D., Johansson, A., Al-Abideen, H.Z.: Dynamics of crowd disasters: An empirical study. Phys. Rev. E 75, 046109 (2007)
Google Scholar
Helbing, D., Molnár, P., Farkas, I., Bolay, K.: Self-organizing pedestrian movement. Environment and Planning. Part B: Planning and Design 28, 361–383 (2001)
Article Google Scholar
Herman, I.P.: Physics of the Human Body. Springer (2007)
Google Scholar
Hoogerndoorn, S.P., Daamen, W.: A novel calibration approach of microscopic pedestrian models. In: Pedestrian Behavior, pp. 195–214. Emerald (2009)
Google Scholar
Littman, M.L.: Markov games as a framework for multi-agent reinforcement learning. In: Proceedings of the Eleventh International Conference on Machine Learning, New Brunswick, NJ, pp. 157–163 (2005)
Google Scholar
Martinez-Gil, F., Lozano, M., Fernández, F.: Multi-agent Reinforcement Learning for Simulating Pedestrian Navigation. In: Vrancx, P., Knudson, M., Grześ, M. (eds.) ALA 2011. LNCS, vol. 7113, pp. 54–69. Springer, Heidelberg (2012)
Chapter Google Scholar
Masen, M.: A systems based experimental approach to tactile friction. Journal of the Mechanical Behavior of Biomedical Materials, 1620–1626 (2011)
Google Scholar
Mataric, M.J.: Learning to behave socially. In: From Animals to Animats: Int. Conf. on Simulation of Adaptive Behavior, pp. 453–462. MIT Press (1994)
Google Scholar
Mori, M., Tsukaguchi, H.: A new method for evaluation of level of service in pedestrian facilities. Transportation Research Part A 21(3), 223–234 (1987)
Article Google Scholar
Seyfried, A., Steffen, B., Klingsch, W., Lippert, T., Boltes, M.: Steps toward the fundamental diagram. Empirical results and modelling. In: Pedestrian and Evacuation Dynamics 2005, Part 3, pp. 377–390. Springer (2007)
Google Scholar
Pelechano, N., Allbeck, J., Badler, N.: Controlling individual agents in high-density crowd simulation. In: Proc. ACM/SIGGRAPH/Eurographycs Symp. Computer Animation, pp. 99–108 (2007)
Google Scholar
Robin, T., Antonioni, G., Bierlaire, M., Cruz, J.: Specification, estimation and validation of a pedestrian walking behavior model. Transportation Research 43, 36–56 (2009)
Article Google Scholar
Schadschneider, A., Seyfried, A.: Validation of ca models of pedestrian dynamics with fundamental diagrams. Cybernetics and Systems 5(40), 367–389 (2009)
Article Google Scholar
Sen, S., Sekaran, M.: Multiagent Coordination with Learning Classifier Systems. In: Weiss, G., Sen, S. (eds.) IJCAI-WS 1995. LNCS, vol. 1042, pp. 218–233. Springer, Heidelberg (1996)
Chapter Google Scholar
Shao, W., Terzopoulos, D.: Autonomous pedestrians. In: Proceedings of the 2005 ACM SIGGRAPH Symposium on Computer Animation, pp. 19–28. ACM Press, New York (2005)
Google Scholar
Steiner, A., Philipp, M., Schmid, A.: Parameter stimation for pedestrian simulation model. In: 7th Swiss Transport Research Conf., pp. 1–29 (2007)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
Google Scholar
Szepesvári, C.: Algorithms for reinforcement learning. Morgan Claypool (2010)
Google Scholar
Teknomo, K.: Microscopic Pedestrian Flow Characteristics: Development of an Image Processing Data Collection and Simulation Model. PhD thesis, Tohoku University, Japan (2002)
Google Scholar
Weidmann, U.: Transporttechnik der fussgänger - transporttechnische eigenschaften des fussgängerverkehrs (literaturstudie). Literature Research 90, IVT an der ETH Zürich, ETH-Hönggerberg, CH-8093 Zürich (1993)
Google Scholar

Download references

Author information

Authors and Affiliations

Departament d’Informàtica, Universitat de València, Av. de la Universidad s/n, 46100, Burjassot, Valencia, Spain
Francisco Martinez-Gil & Miguel Lozano
Department of Computer Science, Universidad Carlos III, Av. de la Universidad 30, 28911, Leganés, Madrid, Spain
Fernando Fernández

Authors

Francisco Martinez-Gil
View author publications
You can also search for this author in PubMed Google Scholar
Miguel Lozano
View author publications
You can also search for this author in PubMed Google Scholar
Fernando Fernández
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Engineering, University of California, Merced, 5200 N. Lake Road, 95343, Merced, CA, USA
Marcelo Kallmann
Department of Computer Science, Rutgers University, 110 Frelinghuysen Road, 08854, Piscataway, NJ, USA
Kostas Bekris

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Martinez-Gil, F., Lozano, M., Fernández, F. (2012). Calibrating a Motion Model Based on Reinforcement Learning for Pedestrian Simulation. In: Kallmann, M., Bekris, K. (eds) Motion in Games. MIG 2012. Lecture Notes in Computer Science, vol 7660. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34710-8_28

Download citation

DOI: https://doi.org/10.1007/978-3-642-34710-8_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34709-2
Online ISBN: 978-3-642-34710-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics