Direct training method for a continuous-time nonlinear optimal feedback controller

Edwards, N. J.; Goh, C. J.

doi:10.1007/BF02191983

Direct training method for a continuous-time nonlinear optimal feedback controller

Contributed Papers
Published: March 1995

Volume 84, pages 509–528, (1995)
Cite this article

Journal of Optimization Theory and Applications Aims and scope Submit manuscript

N. J. Edwards¹ &
C. J. Goh²

117 Accesses
16 Citations
Explore all metrics

Abstract

The solutions of most nonlinear optimal control problems are given in the form of open-loop optimal control which is computed from a given fixed initial condition. Optimal feedback control can in principle be obtained by solving the corresponding Hamilton-Jacobi-Bellman dynamic programming equation, though in general this is a difficult task. We propose a practical and effective alternative for constructing an approximate optimal feedback controller in the form of a feedforward neural network, and we justify this choice by several reasons. The controller is capable of approximately minimizing an arbitrary performance index for a nonlinear dynamical system for initial conditions arising from a nontrivial bounded subset of the state space. A direct training algorithm is proposed and several illustrative examples are given.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Bryson, A. E., andHo, Y. C.,Applied Optimal Control, Hemisphere Publishing Company, Washington DC, 1975.
Google Scholar
Goh, C. J.,On the Nonlinear Optimal Regulator, Automatica, Vol. 29, pp. 751–756, 1993.
Google Scholar
Biswas, S. K., andAhmed, N. U.,Optimal Feedback Control of Power Systems Governed by Nonlinear Dynamics, Optimal Control Applications and Methods, Vol. 7, pp. 289–303, 1986.
Google Scholar
Goh, C. J., Edwards, N. J., andZomaya, A. Y.,Feedback Control of Minimum Time Optimal Control Problems Using Neural Networks, Optimal Control Applications and Methods, Vol. 14, pp. 1–16, 1993.
Google Scholar
Nguyen, D. H., andWidrow, B.,Neural Networks for Self Learning Control Systems, IEEE Control Systems Magazine, pp. 18–23, April 1990.
Goh, C. J., andEdwards, N. J.,Synthesis of Discrete-time Optimal Feedback Controller: The Neural Network Approach, International Journal of Systems Science, Vol. 25, pp. 1235–1248, 1994.
Google Scholar
Funahashi, K.,On the Approximate Realization of Continuous Mapping by Neural Networks, Neural Networks, Vol. 2, pp. 183–192, 1988.
Google Scholar
Hornik, K., Stinchcombe, M., andWhite, H.,Multilayer Feedforward Networks Are Universal Approximators, Neural Networks, Vol. 2, pp. 359–366, 1989.
Google Scholar
Hecht-Nielsen, R.,Neurocomputing, Addison-Wesley Publishing Company, Reading, Massachusetts, 1990.
Google Scholar
Hertz, J., Krogh, A., andPalmer, R. G.,Introduction to the Theory of Neural Computation, Addison-Wesley Publishing Company, Reading, Massachusetts, 1991.
Google Scholar
Narendra, K. S., andParthasarathy, K.,Identification and Control of Dynamical Systems Using Neural Networks, IEEE Transactions on Neural Networks, Vol. 1, pp. 4–27, 1990.
Google Scholar
Jennings, L. S., Fisher, M. E., Teo, K. L., andGoh, C. J.,MISER3: Solving Optimal Control Problems: An Update, Advances in Engineering Softwares, Vol. 13, pp. 190–196, 1991.
Google Scholar
Teo, K. L., Goh, C. J., andWong, K. H.,A Unified Computational Approach to Optimal Control Problems, Harlow Longman Scientific and Technical, London, England, 1991.
Google Scholar
Garrard, W. L., andJordan, J. M.,Design of Nonlinear Automatic Flight Control Systems, Automatica, Vol. 13, pp. 497–505, 1977.
Google Scholar

Download references

Author information

Authors and Affiliations

Telecom Australia Research Laboratory, Clayton, Victoria, Australia
N. J. Edwards (Research Scientist)
Department of Mathematics, University of Western Australia, Nedlands, Western Australia, Australia
C. J. Goh (Associate Professor)

Authors

N. J. Edwards
View author publications
You can also search for this author in PubMed Google Scholar
C. J. Goh
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Communicated by M. Simaan

This research was carried out with the support of a grant from the Australian Research Council.

We thank the anonymous reviewers for their helpful comments.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Edwards, N.J., Goh, C.J. Direct training method for a continuous-time nonlinear optimal feedback controller. J Optim Theory Appl 84, 509–528 (1995). https://doi.org/10.1007/BF02191983

Download citation

Issue Date: March 1995
DOI: https://doi.org/10.1007/BF02191983

Key Words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Direct training method for a continuous-time nonlinear optimal feedback controller

Abstract

Access this article

Similar content being viewed by others

Automated machine learning: past, present and future

Fundamentals of Artificial Neural Networks and Deep Learning

Multilayer Perceptron (MLP)

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Key Words

Navigation

Direct training method for a continuous-time nonlinear optimal feedback controller

Abstract

Access this article

Similar content being viewed by others

Automated machine learning: past, present and future

Fundamentals of Artificial Neural Networks and Deep Learning

Multilayer Perceptron (MLP)

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key Words

Search

Navigation