Abstract
The solutions of most nonlinear optimal control problems are given in the form of open-loop optimal control which is computed from a given fixed initial condition. Optimal feedback control can in principle be obtained by solving the corresponding Hamilton-Jacobi-Bellman dynamic programming equation, though in general this is a difficult task. We propose a practical and effective alternative for constructing an approximate optimal feedback controller in the form of a feedforward neural network, and we justify this choice by several reasons. The controller is capable of approximately minimizing an arbitrary performance index for a nonlinear dynamical system for initial conditions arising from a nontrivial bounded subset of the state space. A direct training algorithm is proposed and several illustrative examples are given.
Similar content being viewed by others
References
Bryson, A. E., andHo, Y. C.,Applied Optimal Control, Hemisphere Publishing Company, Washington DC, 1975.
Goh, C. J.,On the Nonlinear Optimal Regulator, Automatica, Vol. 29, pp. 751–756, 1993.
Biswas, S. K., andAhmed, N. U.,Optimal Feedback Control of Power Systems Governed by Nonlinear Dynamics, Optimal Control Applications and Methods, Vol. 7, pp. 289–303, 1986.
Goh, C. J., Edwards, N. J., andZomaya, A. Y.,Feedback Control of Minimum Time Optimal Control Problems Using Neural Networks, Optimal Control Applications and Methods, Vol. 14, pp. 1–16, 1993.
Nguyen, D. H., andWidrow, B.,Neural Networks for Self Learning Control Systems, IEEE Control Systems Magazine, pp. 18–23, April 1990.
Goh, C. J., andEdwards, N. J.,Synthesis of Discrete-time Optimal Feedback Controller: The Neural Network Approach, International Journal of Systems Science, Vol. 25, pp. 1235–1248, 1994.
Funahashi, K.,On the Approximate Realization of Continuous Mapping by Neural Networks, Neural Networks, Vol. 2, pp. 183–192, 1988.
Hornik, K., Stinchcombe, M., andWhite, H.,Multilayer Feedforward Networks Are Universal Approximators, Neural Networks, Vol. 2, pp. 359–366, 1989.
Hecht-Nielsen, R.,Neurocomputing, Addison-Wesley Publishing Company, Reading, Massachusetts, 1990.
Hertz, J., Krogh, A., andPalmer, R. G.,Introduction to the Theory of Neural Computation, Addison-Wesley Publishing Company, Reading, Massachusetts, 1991.
Narendra, K. S., andParthasarathy, K.,Identification and Control of Dynamical Systems Using Neural Networks, IEEE Transactions on Neural Networks, Vol. 1, pp. 4–27, 1990.
Jennings, L. S., Fisher, M. E., Teo, K. L., andGoh, C. J.,MISER3: Solving Optimal Control Problems: An Update, Advances in Engineering Softwares, Vol. 13, pp. 190–196, 1991.
Teo, K. L., Goh, C. J., andWong, K. H.,A Unified Computational Approach to Optimal Control Problems, Harlow Longman Scientific and Technical, London, England, 1991.
Garrard, W. L., andJordan, J. M.,Design of Nonlinear Automatic Flight Control Systems, Automatica, Vol. 13, pp. 497–505, 1977.
Author information
Authors and Affiliations
Additional information
Communicated by M. Simaan
This research was carried out with the support of a grant from the Australian Research Council.
We thank the anonymous reviewers for their helpful comments.
Rights and permissions
About this article
Cite this article
Edwards, N.J., Goh, C.J. Direct training method for a continuous-time nonlinear optimal feedback controller. J Optim Theory Appl 84, 509–528 (1995). https://doi.org/10.1007/BF02191983
Issue Date:
DOI: https://doi.org/10.1007/BF02191983