Dynamical systems for predictive control of autonomous robots

Herrmann, J. Michael

doi:10.1007/s12064-001-0021-0

Dynamical systems for predictive control of autonomous robots

Published: December 2001

Volume 120, pages 241–252, (2001)
Cite this article

Theory in Biosciences Aims and scope Submit manuscript

J. Michael Herrmann¹

75 Accesses
8 Citations
Explore all metrics

Summary

Regularities in the environment are accessible to an autonomous agents as reproducible relations between actions and perceptions and can be exploited by unsupervised learning. Our approach is based on the possibility to perform and to verify predictions about perceivable consequences of actions. It is implemented as a three-layer neural network that combines predictive perception, internal-state transitions and action selection into a loop which closes via the environment. In addition to minimizing prediction errors, the goal of network adaptation comprises also an optimization of the minimization rate such that new behaviors are favored over already learned ones, which would result in a vanishing improvement of predictability. Previously learned behaviors are reactivated or continued if triggering stimuli are available and an externally or otherwise given reward overcompensates the decay of the learning rate. In the model, behavior learning and learning behavior are brought about by the same mechanism, namely the drive to continuously experience learning success. Behavior learning comprises representation and storage of learned behaviors and finally their inhibition such that a further exploration of the environment is possible. Learning behavior, in contrast, detects the frontiers of the manifold of learned behaviors and provides estimates of the learnability of behaviors leading outwards the field of expertise. The network module has been implemented in a Khepera miniature robot. We also consider hierarchical architectures consisting of several modules in one agent as well as groups of several agents, which are controlled by such networks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Arkin, R. C. (1998) Behavior-based robotics. Cambridge (Mass.) MIT Press.
Google Scholar
Burgess, N. e. a. (eds.) The hippocampal and parietal foundations of spatial cognition. Oxford University Press, 1999.
Der R., Steinmetz, U.; Pasemann, F. (1999) Homeokinesis — A new principle to back up evolution with learning. In: M. Mohammadian (ed.) Concurrent Systems Engineering Series, vol. 55, Computational Intelligence for Modelling, Control, and Automation. IOS Press, pp. 43–47.
Duckett, T.; Nehmzow, U. (1997) Knowing your place in the real world. ECAL-97 Fourth European Conference on Artificial Life University of Sussex, http://www.cogs.susx.ac.uk/ecal97/.
Franz, M. O.; Schölkopf, B.; Mallot, H. A.; Bülthoff, H. H. (1998) Learning view graphs for robot navigation. Autonomous Robots, 5: 111–125.
Article Google Scholar
Glasersfeld, E. von (1987) The Construction of Knowledge. Seaside: Intersystems Publications.
Google Scholar
Herrmann, J. M.; Pawelzik, K.; Geisel, T. (1999) Selflocalization of autonomous robots by hidden representations. Autonomous Robots 7 (1): 31–40.
Article Google Scholar
Hertz, J. A.; Krogh, A.; Palmer, R. G. (1991) Introduction to the theory of neural computation. Redwood: Addison-Wesley.
Google Scholar
Hertz, J.; Prügel-Bennett, A. (1996) Learning short synfire chains by self-organization. Network: Computation in Neural Systems 7: 357–364.
Article CAS Google Scholar
Hinton, G. E.; Dayan, P.; To, A.; Neal, R. (1995) The Helmholtz machine through time. Proceedings of the International Conference on Artificial Neural Networks, EC2∖Cie, Paris, vol. 1, 483–490.
Google Scholar
Muller, R. (1996) A quarter of a century of place cells. Neuron 17: 813–822.
Article PubMed CAS Google Scholar
Nolfi, S.; Floreano, D. (2000) Evolutionary Robotics: The Biology, Intelligence, and Technology of Self-Organizing Machines. Cambridge, MA: MIT Press/Bradford Books.
Google Scholar
Oore, S.; Hinton, G. E.; Dudek, G. (1997) A mobile robot that learns its place. Neural Computation, 9: 683–699.
Article Google Scholar
Redlich, A. N. (1993) Redundancy reduction as a strategy for unsupervised learning. Neural Computation 5: 289–304.
Article Google Scholar
Shatkay, H.; Kaelbling, L. P. (1997) Learning topological maps with weak local odometric information. In: Proceedings of the 15th International Joint Conference on Artificial Intelligence.
Sutton, R. S.; Barto, A. G. (1998) Reinforcement learning: An introduction. Cambridge (Mass.) MIT Press.
Google Scholar
Thrun, S. (1997) Learning metric-topological maps for indoor mobile robot navigation. Artificial Intelligence, 99: 21–71.
Article Google Scholar
Yamauchi, B.; Schultz, A.; Adams, W. (1998) Mobile robot exploration and map-building with continuous localization. Proceedings of the 1998 IEEE Conference on Robotics and Automation, Leuven, Belgium, pp. 3715–3720.

Download references

Author information

Authors and Affiliations

Max-Planck-Institut für Strömungsforschung, Bunsenstraße 10, D-37073, Göttingen, Germany
J. Michael Herrmann

Authors

J. Michael Herrmann
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to J. Michael Herrmann.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Herrmann, J.M. Dynamical systems for predictive control of autonomous robots. Theory Biosci. 120, 241–252 (2001). https://doi.org/10.1007/s12064-001-0021-0

Download citation

Issue Date: December 2001
DOI: https://doi.org/10.1007/s12064-001-0021-0

Key words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Dynamical systems for predictive control of autonomous robots

Summary

Access this article

Similar content being viewed by others

A practical guide to multi-objective reinforcement learning and planning

Economical Quadrupedal Multi-Gait Locomotion via Gait-Heuristic Reinforcement Learning

A review of motion planning algorithms for intelligent robots

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Key words

Navigation

Dynamical systems for predictive control of autonomous robots

Summary

Access this article

Similar content being viewed by others

A practical guide to multi-objective reinforcement learning and planning

Economical Quadrupedal Multi-Gait Locomotion via Gait-Heuristic Reinforcement Learning

A review of motion planning algorithms for intelligent robots

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Key words

Search

Navigation