Adaptive control of a looper-like robot based on the CPG-actor-critic method

Makino, Kenji; Nakamura, Yutaka; Shibata, Tomohiro; Ishii, Shin

doi:10.1007/s10015-007-0453-9

Adaptive control of a looper-like robot based on the CPG-actor-critic method

Original Article
Published: 01 April 2008

Volume 12, pages 129–132, (2008)
Cite this article

Artificial Life and Robotics Aims and scope Submit manuscript

Kenji Makino¹,
Yutaka Nakamura²,
Tomohiro Shibata¹ &
…
Shin Ishii¹

100 Accesses
1 Citation
Explore all metrics

Abstract

Adaptability to the environment is crucial for mobile robots, because the circumstances, including the body of the robot, may change. A robot with a large number of degrees of freedom possesses the potential to adapt to such circumstances, but it is difficult to design a good controller for such a robot. We previously proposed a reinforcement learning (RL) method called the CPG actor-critic method, and applied it to the automatic acquisition of vermicular locomotion of a looper-like robot through computer simulations. In this study, we developed a looper-like robot and applied our RL method to the control of this robot. Experimental results demonstrate fast acquisition of a vermicular forward motion, supporting the real applicability of our method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Trajectory tracking of mobile robots using hedge-agebras-based controllers

Article 22 April 2024

Challenges of real-world reinforcement learning: definitions, benchmarks and analysis

Article 22 April 2021

References

Peters J, Vijayakumar S, Schaal S (2003) Reinforcement learning for humanoid robotics. 3rd IEEE International Conference on Humanoid Robotics, Germany
Taga G, Yamaguchi Y, Shimizu H (1991) Self-organized control of bipedal locomotion by neural oscillators in unpredictable environment. Biol Cybern 65:147–159
Article MATH Google Scholar
Fukuoka Y, Kimura H, Cohen AH (2003) Adaptive dynamic walking of a quadruped robot on irregular terrain based on biological concepts. Int J Robotics Res 22:187–202
Article Google Scholar
Nakamura Y, Mori T, Ishii S (2004) International conference on parallel problem solving from nature. (PPSN VII), LNCS 3242, Springer Berlin, Heidelberg, pp 972–981
Google Scholar
Nakamura Y, Mori T, Sato M, et al. (2007) Reinforcement learning for a biped robot based on a CPG actor-critic method. Neural Networks 20(6):723–735
Article MATH Google Scholar
Fukunaga S, Nakamura Y, Aso K, et al. (2004) Reinforcement learning for a snake-like robot controlled by a central pattern generator. IEEE Conference on Robotics, Automation and Mechatronics, pp 909–914
Nakamura Y, Mori T, Ishii S (2006) Natural policy gradient reinforcement learning method for a looper-like robot. 11th International Symposium on Artificial Life and Robotics (AROB11), 2006 Beppu, Oita, Japan
Konda VR, Tsitsiklis JN (2003) Actor-critic algorithms. SIAM J Control Optimization 42:1143–1146
Article MATH MathSciNet Google Scholar
Sutton RS, McAllester D, Singh S et al. (2000) Policy gradient method for reinforcement learning with function approximation. Adv Neural Inf Process Syst 12:1057–1063
Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Information Science, Nara Institute of Science and Technology (NAIST), Nara, Japan
Kenji Makino, Tomohiro Shibata & Shin Ishii
Graduate School of Engineering, Osaka University, 2-1 Yamadaoka, Suita, 565-0871, Japan
Yutaka Nakamura

Authors

Kenji Makino
View author publications
You can also search for this author in PubMed Google Scholar
Yutaka Nakamura
View author publications
You can also search for this author in PubMed Google Scholar
Tomohiro Shibata
View author publications
You can also search for this author in PubMed Google Scholar
Shin Ishii
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yutaka Nakamura.

About this article

Cite this article

Makino, K., Nakamura, Y., Shibata, T. et al. Adaptive control of a looper-like robot based on the CPG-actor-critic method. Artif Life Robotics 12, 129–132 (2008). https://doi.org/10.1007/s10015-007-0453-9

Download citation

Received: 16 May 2007
Accepted: 16 May 2007
Published: 01 April 2008
Issue Date: March 2008
DOI: https://doi.org/10.1007/s10015-007-0453-9

Key words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Adaptive control of a looper-like robot based on the CPG-actor-critic method

Abstract

Access this article

Similar content being viewed by others

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Trajectory tracking of mobile robots using hedge-agebras-based controllers

Challenges of real-world reinforcement learning: definitions, benchmarks and analysis

References

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Key words

Navigation

Adaptive control of a looper-like robot based on the CPG-actor-critic method

Abstract

Access this article

Similar content being viewed by others

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Trajectory tracking of mobile robots using hedge-agebras-based controllers

Challenges of real-world reinforcement learning: definitions, benchmarks and analysis

References

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Share this article

Key words

Search

Navigation