Versatile modular neural locomotion control with fast learning

Thor, Mathias; Manoonpong, Poramate

doi:10.1038/s42256-022-00444-0

Article
Published: 14 February 2022

Versatile modular neural locomotion control with fast learning

Nature Machine Intelligence volume 4, pages 169–179 (2022)Cite this article

2310 Accesses
10 Citations
37 Altmetric
Metrics details

Subjects

A preprint version of the article is available at arXiv.

Abstract

Legged robots have significant potential to operate in unstructured environments. The design of locomotion control is, however, still challenging. Currently, controllers must be manually designed for specific robots and tasks, or automatically designed via machine learning methods that require long training times and yield large opaque controllers. Drawing inspiration from animal locomotion, we propose a simple yet versatile modular neural control structure with fast learning. The key advantages of our approach are that behaviour-specific control modules can be added incrementally to obtain increasingly complex emergent locomotion behaviours, and that neural connections can be quickly and automatically learned. In a series of experiments, we show how eight modules can be quickly learned and added to a base control module to obtain emergent adaptive behaviours allowing a hexapod robot to navigate in complex environments. We also show that modules can be added and removed during operation without affecting the functionality of the remaining controller. Finally, the controller is successfully demonstrated on a physical robot. Taken together, our study reveals a significant step towards fast automatic design of versatile neural locomotion control.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Overview of the versatile modular neural locomotion controller.**

**Fig. 2: Results of learning the base and obstacle reflex controllers.**

**Fig. 3: Results of learning the body posture and directional locomotion controllers.**

**Fig. 4: Results for using the learned primitive control modules.**

**Fig. 5: The advanced behaviours and the results for using all eight control modules.**

**Fig. 6: Detailed overview of the versatile modular neural locomotion controller.**

Machine learning reveals the control mechanics of an insect wing hinge

Article 17 April 2024

Johan M. Melis, Igor Siwanowicz & Michael H. Dickinson

Complex activity and short-term plasticity of human cerebral organoids reciprocally connected with axons

Article Open access 10 April 2024

Tatsuya Osaki, Tomoya Duenki, … Yoshiho Ikeuchi

Neural operators for accelerating scientific simulations and design

Article 08 April 2024

Kamyar Azizzadenesheli, Nikola Kovachki, … Anima Anandkumar

Data availability

All data from running the experiments as well as the learned weight sets can be accessed at https://github.com/MathiasThor/CPG-RBFN-framework/tree/main/data⁵⁰.

Code availability

The source code for running the controller in simulation can be accessed at https://github.com/MathiasThor/CPG-RBFN-framework⁵⁰.

References

Lee, J., Hwangbo, J., Wellhausen, L., Koltun, V. & Hutter, M. Learning quadrupedal locomotion over challenging terrain. Sci. Robot. 5, eabc5986 (2020).
Article Google Scholar
Winkler, A. W. Optimization-Based Motion Planning for Legged Robots. PhD thesis, ETH Zurich (2018).
Machado, J. A. T. & Silva, M. F. An overview of legged robots. In Proc. MME International Symposium on Mathematical Methods in Engineering (2006).
Thor, M., Kulvicius, T. & Manoonpong, P. Generic neural locomotion control framework for legged robots. In IEEE Transactions on Neural Networks and Learning Systems Vol. 32, 4013–4025 (IEEE, 2021).
Cully, A., Clune, J., Tarapore, D. & Mouret, J.-B. Robots that can adapt like animals. Nature 521, 503–507 (2015).
Article Google Scholar
Silva, M. F. & Machado, J. A. T. A literature review on the optimization of legged robots. J. Vib. Control 18, 1753–1767 (2012).
Article MathSciNet Google Scholar
Hwangbo, J. et al. Learning agile and dynamic motor skills for legged robots. Sci. Robot. 4, eaau5872 (2019).
Article Google Scholar
Clune, J., Stanley, K. O., Pennock, R. T. & Ofria, C. On the performance of indirect encoding across the continuum of regularity. IEEE Trans. Evol. Comput. 15, 346–367 (2011).
Article Google Scholar
Schilling, M., Konen, K., Ohl, F. W. & Korthals, T. Decentralized deep reinforcement learning for a distributed and adaptive locomotion controller of a hexapod robot. In Proc. IEEE Int. Conf. Intell. Robots Syst. 5335–5342 (IEEE, 2020).
Yang, C., Yuan, K., Zhu, Q., Yu, W. & Li, Z. Multi-expert learning of adaptive legged locomotion. Sci. Robot. 5, eabb2174 (2020).
Article Google Scholar
Schilling, M., Konen, K. & Korthals, T. Modular deep reinforcement learning for emergent locomotion on a six-legged robot. In Proc. 8th IEEE RAS/EMBS International Conference for Biomedical Robotics and Biomechatronics (BioRob) 946–953 (IEEE, 2020).
Oliveira, M., Costa, L., Rocha, A., Santos, C. & Ferreira, M. Multiobjective optimization of a quadruped robot locomotion using a genetic algorithm. In Proc. Soft Computing in Industrial Applications 427–436 (Springer, 2011).
Kullander, K. et al. Role of epha4 and ephrinb3 in local neuronal circuits that control walking. Science 299, 1889–1892 (2003).
Article Google Scholar
Biewener, A. A. Animal Locomotion 1st edn (Oxford Univ. Press, 2003).
Büschges, A. Sensory control and organization of neural networks mediating coordination of multisegmental organs for locomotion. J. Neurophysiol. 93, 1127–1135 (2005).
Article Google Scholar
Azayev, T. & Zimmerman, K. Blind hexapod locomotion in complex terrain with gait adaptation using deep reinforcement learning and classification. J. Intell. Robot. Syst. 99, 659–671 (2020).
Article Google Scholar
Delcomyn, F. Walking robots and the central and peripheral control of locomotion in insects. Auton. Robot. 7, 259–270 (1999).
Article Google Scholar
Samek, W. & Müller, K.-R. Towards Explainable Artificial Intelligence 5–22 (Springer, 2019).
Thor, M., Larsen, J. C. & Manoonpong, P. MORF—modular robot framework. In Proc. 2nd International Youth Conference of Bionic Engineering 21–23 (Frontiers, 2018).
Yadav, R. N., Kalra, P. K. & John, J. On the use of multiplicative neuron in feedforward neural networks. Int. J. Simul. Model. 26, 331–336 (2006).
Article Google Scholar
Schmitt, M. On the complexity of computing and learning with multiplicative neural networks. Neural Comput. 14, 241–301 (2002).
Article Google Scholar
Koch, C. & Poggio, T. in Single Neuron Computation, Neural Networks: Foundations to Applications (eds McKenna, T. et al.) Ch. 12 (Academic Press, 1992).
Hashlamon, I. & Erbatur, K. Joint sensor fault detection and recovery based on virtual sensor for walking legged robots. In Proc. IEEE 23rd International Symposium on Industrial Electronics (ISIE) 1210–1214 (IEEE, 2014).
Perla, R., Mukhopadhyay, S. & Samanta, A. N. Sensor fault detection and isolation using artificial neural networks. In Proc. IEEE Region 10 Conference TENCON Vol. 4, 676–679 (IEEE, 2004).
Christensen, A. L., O’Grady, R., Birattari, M. & Dorigo, M. Fault detection in autonomous robots based on fault injection and learning. Auton. Robot. 24, 49–67 (2008).
Article Google Scholar
Patle, B., Babu L, G., Pandey, A., Parhi, D. & Jagadeesh, A. A review: on path planning strategies for navigation of mobile robot. Def. Technol. 15, 582–606 (2019).
Google Scholar
Goldschmidt, D., Manoonpong, P. & Dasgupta, S. A neurocomputational model of goal-directed navigation in insect-inspired artificial agents. Front. Neurorobot. 11, 20 (2017).
Article Google Scholar
Brooks, R. A robust layered control system for a mobile robot. IEEE Robot. Autom. Lett. 2, 14–23 (1986).
Article Google Scholar
Manoonpong, P., Geng, T., Kulvicius, T., Porr, B. & Wörgötter, F. Adaptive, fast walking in a biped robot under neuronal control and learning. PLoS Comput. Biol. 3, e134 (2007).
Article Google Scholar
Jakobi, N. Evolutionary robotics and the radical envelope-of-noise hypothesis. Adapt. Behav. 6, 325–368 (1997).
Article Google Scholar
Demin, V. & Nekhaev, D. Recurrent spiking neural network learning based on a competitive maximization of neuronal activity. Front. Neurorobot. 12, 79 (2018).
Google Scholar
Pfeiffer, M. & Pfeil, T. Deep learning with spiking neurons: opportunities and challenges. Front. Neurorobot. 12, 774 (2018).
Article Google Scholar
Strohmer, B., Manoonpong, P. & Larsen, L. B. Flexible spiking CPGs for online manipulation during hexapod walking. Front. Neurorobot. 14, 41 (2020).
Article Google Scholar
Gutierrez-Galan, D., Dominguez-Morales, J. P., Perez-Peña, F., Jimenez-Fernandez, A. & Linares-Barranco, A. Neuropod: a real-time neuromorphic spiking CPG applied to robotics. Neurocomputing 381, 10–19 (2020).
Article Google Scholar
Espinal, A. et al. Design of spiking central pattern generators for multiple locomotion gaits in hexapod robots by christiansen grammar evolution. Front. Neurorobot. 10, 6 (2016).
Article Google Scholar
Donati, E., Indiveri, G. & Stefanini, C. A novel spiking CPG-based implementation system to control a lamprey robot. In Proc. IEEE International Conference on Biomedical Robotics and Biomechatronics (BioRob) 1364–1364 (IEEE, 2016).
Polykretis, I., Tang, G. & Michmizos, K. P. An astrocyte-modulated neuromorphic central pattern generator for hexapod robot locomotion on intel’s loihi. In Proc. ICONS: International Conference on Neuromorphic Systems 1–9 (Association for Computing Machinery, 2020).
Aoi, S., Manoonpong, P., Ambe, Y., Matsuno, F. & Wörgötter, F. Adaptive control strategies for interlimb coordination in legged robots: a review. Front. Neurorobot. 11, 39 (2017).
Article Google Scholar
Nachstedt, T., Tetzlaff, C. & Manoonpong, P. Fast dynamical coupling enhances frequency adaptation of oscillators for robotic locomotion control. Front. Neurorobot. 11, 14 (2017).
Article Google Scholar
Pasemann, F., Hild, M. & Zahedi, K. SO(2)-networks as neural oscillators. In Proc. Computational Methods in Neural Modeling 144–151 (Springer, 2003).
Pasemann, F. & Stollenwerk, N. Attractor switching by neural control of chaotic neurodynamics. Netw. Comput. Neural Syst. 9, 549–561 (1998).
Article Google Scholar
Pasemann, F. Complex dynamics and the structure of small neural networks. Netw. Comput. Neural Syst. 13, 195–216 (2002).
Article Google Scholar
Steingrube, S., Timme, M., Wörgötter, F. & Manoonpong, P. Self-organized adaptation of a simple neural circuit enables complex robot behaviour. Nat. Phys. 6, 224–230 (2010).
Article Google Scholar
Thor, M. & Manoonpong, P. Error-based learning mechanism for fast online adaptation in robot motor control. IEEE Trans. Neural Netw. Learn. Syst. 31, 2042–2051 (2019).
Article Google Scholar
Manoonpong, P., Parlitz, U. & Wörgötter, F. Neural control and adaptive neural forward models for insect-like, energy-efficient, and adaptable locomotion of walking machines. Front. Neural Circuits 7, 12 (2013).
Article Google Scholar
Broomhead, D. & Lowe, D. Radial basis functions, multi-variable functional interpolation and adaptive networks. R. Signals Radar Estab. 4148, 801–849 (1988).
Google Scholar
Stulp, F. & Sigaud, O. Policy improvement: between black-box optimization and episodic reinforcement learning. In Proc. Journées Francophones Planification, Décision, et Apprentissage Pour la Conduite de Systémes hal-00922133 (2013).
Theodorou, E., Buchli, J. & Schaal, S. A generalized path integral control approach to reinforcement learning. J. Mach. Learn. Res. 11, 3137–3181 (2010).
MathSciNet MATH Google Scholar
Chatterjee, S. et al. Learning and chaining of motor primitives for goal-directed locomotion of a snake-like robot with screw-drive units. Int. J. Adv. Robot. Syst. 12, 176 (2015).
Google Scholar
Thor, M. Mathiasthor/cpg-rbfn-framework: first release. Zenodo https://doi.org/10.5281/zenodo.5524494 (2021).

Download references

Acknowledgements

We thank the members of the SDU Biorobotics group for their technical support and helpful discussions. We also thank A. L. Christensen for fruitful discussions and detailed feedback. This work was supported in part by the Horizon 2020 Framework Programme (FETPROACT-01-2016 FET Proactive: Emerging Themes and Communities) under grant 732266 (Plan4Act) (P.M., project workpackage principal investigator) and in part by a start-up grant on Bio-inspired Robotics from the Vidyasirimedhi Institute of Science and Technology (VISTEC) (P.M., project principal investigator).

Author information

Authors and Affiliations

Embodied AI and Neurorobotics Laboratory, SDU Biorobotics, The Mærsk Mc-Kinney Møller Institute, The University of Southern Denmark, Odense, Denmark
Mathias Thor & Poramate Manoonpong
Bio-Inspired Robotics and Neural Engineering Laboratory, School of Information Science and Technology, Vidyasirimedhi Institute of Science and Technology, Rayong, Thailand
Poramate Manoonpong

Authors

Mathias Thor
View author publications
You can also search for this author in PubMed Google Scholar
Poramate Manoonpong
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.T. contributed to the modular neural control structure, implementation, experiments, data analysis and the manuscript. P.M. contributed to the embodied neural control structure, data analysis and the manuscript.

Corresponding author

Correspondence to Mathias Thor.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Machine Intelligence thanks Luca Patané, Fernando Perez-Peña and Ren Qinyuan for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Thor, M., Manoonpong, P. Versatile modular neural locomotion control with fast learning. Nat Mach Intell 4, 169–179 (2022). https://doi.org/10.1038/s42256-022-00444-0

Download citation

Received: 15 June 2021
Accepted: 11 January 2022
Published: 14 February 2022
Issue Date: February 2022
DOI: https://doi.org/10.1038/s42256-022-00444-0

This article is cited by

Viability leads to the emergence of gait transitions in learning agile quadrupedal locomotion on challenging terrains
- Milad Shafiee
- Guillaume Bellegarda
- Auke Ijspeert
Nature Communications (2024)
Hybrid hierarchical learning for solving complex sequential tasks using the robotic manipulation network ROMAN
- Eleftherios Triantafyllidis
- Fernando Acero
- Zhibin Li
Nature Machine Intelligence (2023)
Bionic Muscle Control with Adaptive Stiffness for Bionic Parallel Mechanism
- Yaguang Zhu
- Ruyue Li
- Zhipeng Song
Journal of Bionic Engineering (2023)