Generalization of Reinforcement Learning Agents for Production Control

Overbeck, Leonard; Glaser, Valentin; May, Marvin Carl; Lanza, Gisela

doi:10.1007/978-3-031-34821-1_37

Leonard Overbeck¹²,
Valentin Glaser¹²,
Marvin Carl May¹² &
…
Gisela Lanza¹²

Part of the book series: Lecture Notes in Mechanical Engineering ((LNME))

Included in the following conference series:

Proceedings of the Changeable, Agile, Reconfigurable and Virtual Production Conference and the World Mass Customization & Personalization Conference

732 Accesses

Abstract

In times of rapidly changing markets and increasing complexity the fast and precise adaption of production systems to new circumstances is key for the economic success of manufacturing companies. Given highly adaptable production systems, the control of these systems still has to be optimized for each configuration. One critical aspect is worker control. To enable an automatic control logic which is capable of adapting autonomously and anticipatory to new configurations of the production system, a combination of reinforcement learning (RL) with simulation is promising. Key to a successful implementation of RL in such a dynamic production system is that the RL agent is able to perform well independently of the present system configuration and not just in the one system he is originally trained in. This paper presents an approach to develop such generalized production control RL agents, which was tested and evaluated using a discrete event simulation of a real-world production system. The approach defines a methodology of hyperparameter tuning for generalization including training, agent selection and testing of the RL in independent configurations of the production system. The results indicate that the approach is very successful in creating a generalized production control RL agent, which is able to control the workers efficiently in various configurations of a production system and adapt rapidly to new circumstances.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Westkämper, E., Löffler, C.: Strategien der Produktion. Springer, Berlin (2016)
Google Scholar
Pinedo, M.L.: Scheduling. Springer, US, Boston, MA (2012)
Book MATH Google Scholar
Overbeck, L., Hugues, A., May, M.C., et al.: Reinforcement learning based production control of semi-automated manufacturing systems. Procedia CIRP 103, 170–175 (2021). https://doi.org/10.1016/j.procir.2021.10.027
Article Google Scholar
Belousov, B., Abdulsamad, H., Klink, P., et al.: Reinforcement Learning Algorithms: Analysis and Applications, vol. 883. Springer International Publishing, Cham (2021)
Google Scholar
Kuhnle, A., Röhrig, N., Lanza, G.: Autonomous order dispatching in the semiconductor industry using reinforcement learning. Procedia CIRP 79, 391–396 (2019). https://doi.org/10.1016/j.procir.2019.02.101
Article Google Scholar
Kuhnle, A., May, M.C., Schäfer, L., et al.: Explainable reinforcement learning in production control of job shop manufacturing system. Int. J. Prod. Res. 60, 5812–5834 (2022). https://doi.org/10.1080/00207543.2021.1972179
Article Google Scholar
Cobbe, K., Klimov, O., Hesse, C., et al.: Quantifying Generalization in Reinforcement Learning (2018). https://arxiv.org/pdf/1812.02341
Wang, K., Kang, B., Shao, J., et al.: Improving Generalization in Reinforcement Learning with Mixture Regularization (2020)
Google Scholar
Ripley, B.D.: Pattern Recognition and Neural Networks, 1. Paperback ed. 1997, reprinted 2009. Cambridge University Press, Cambridge (2009)
Google Scholar
Pfeifer, T., Schmitt, R. (eds.): Masing Handbuch Qualitätsmanagement, 7. überarbeitete Auflage. Hanser eLibrary. Hanser, München (2021)
Google Scholar
Schiefer, H., Schiefer, F.: Statistik für Ingenieure. Springer Fachmedien Wiesbaden, Wiesbaden (2018)
Google Scholar
Zhou, Z.-H.: Machine Learning. Springer Singapore, Singapore (2021)
Google Scholar
Kubat, M.: An Introduction to Machine Learning. Springer International Publishing, Cham (2021)
Google Scholar
Kirk, R., Zhang, A., Grefenstette, E., et al.: A Survey of Generalisation in Deep Reinforcement Learning (2021)
Google Scholar
Ahmed, Z., Le Roux, N., Norouzi, M., et al.: Understanding the impact of entropy on policy optimization (2018)
Google Scholar
Dong, H., Ding, Z., Zhang, S.: Deep Reinforcement Learning. Springer Singapore, Singapore (2020)
Google Scholar
Igl, M., Ciosek, K., Li, Y., et al.: Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck (2019)
Google Scholar
Siebertz, K., van Bebber, D., Hochkirchen, T.: Statistische Versuchsplanung. Springer, Berlin (2017)
Google Scholar
Plappert, M., Houthooft, R., Dhariwal, P., et al.: Parameter Space Noise for Exploration (2017)
Google Scholar
Tensorforce Team: Proximal Policy Optimization (2022). https://tensorforce.readthedocs.io/en/latest/agents/ppo.html. Accessed 11 Jan 2023

Download references

Author information

Authors and Affiliations

wbk Institute of Production Science, Karlsruhe Institute of Technology, Kaiserstr. 12, 76131, Karlsruhe, Germany
Leonard Overbeck, Valentin Glaser, Marvin Carl May & Gisela Lanza

Authors

Leonard Overbeck
View author publications
You can also search for this author in PubMed Google Scholar
Valentin Glaser
View author publications
You can also search for this author in PubMed Google Scholar
Marvin Carl May
View author publications
You can also search for this author in PubMed Google Scholar
Gisela Lanza
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Leonard Overbeck .

Editor information

Editors and Affiliations

Department of Industrial Engineering (DIN), Alma Mater Studiorum—University of Bologna, Bologna, Italy
Francesco Gabriele Galizia
Department of Industrial Engineering (DIN), Alma Mater Studiorum—University of Bologna, Bologna, Italy
Marco Bortolini

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Overbeck, L., Glaser, V., May, M.C., Lanza, G. (2023). Generalization of Reinforcement Learning Agents for Production Control. In: Galizia, F.G., Bortolini, M. (eds) Production Processes and Product Evolution in the Age of Disruption. CARV 2023. Lecture Notes in Mechanical Engineering. Springer, Cham. https://doi.org/10.1007/978-3-031-34821-1_37

Download citation

DOI: https://doi.org/10.1007/978-3-031-34821-1_37
Published: 15 July 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-34820-4
Online ISBN: 978-3-031-34821-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics