Introduction: The challenge of reinforcement learning

Sutton, Richard S.

doi:10.1007/BF00992695

Introduction: The challenge of reinforcement learning

Published: May 1992

Volume 8, pages 225–227, (1992)
Cite this article

Download PDF

Machine Learning Aims and scope Submit manuscript

Introduction: The challenge of reinforcement learning

Download PDF

Richard S. Sutton¹

5717 Accesses
85 Citations
Explore all metrics

Article PDF

References

Barto, A.G. Bradtke, S.J. & Singh, S.P. (1991).Real-time learning and control using asynchronous dynamic programming (Technical Report 91-57). Amherst, MA: University of Massachusetts, Computer Science Department.
Google Scholar
Barto, A.G. & Sutton, R.S. (1981). Landmark learning: An illustration of associative search.Biological Cybernetics, 42 1–8.
Google Scholar
Barto, A.G., Sutton, R.S. & Anderson, C.W. (1983). Neuronlike elements that can solve difficult learning control problems.IEEE Trans. on Systems, Man, and Cybernetics, SMC-13 834–846.
Google Scholar
Barto, A.G., Sutton, R.S. & Brouwer, P.S. (1981). Associative search network: A reinforcement learning associative memory.Biological Cybernetics, 40 201–211.
Google Scholar
Booker, L.B. (1988). Classifier systems that learn world models.Machine Learning, 3 161–192.
Google Scholar
Grefenstette, J.J., Ramsey, C.L. & Schultz, A.C. (1990). Learning sequential decision rules using simulation models and competition.Machine Learning, 5 355–382.
Google Scholar
Hampson, S.E. (1983).A neural model of adaptive behavior. Ph.D. dissertation, Dept. of Information and Computer Science, Univ. of Calif., Irvine (Technical Report #213). A revised edition appeared asConnectionist Problem Solving, Boston: Birkhäuser, 1990.
Google Scholar
Holland, J.H. (1975).Adaptation in natural and artificial systems. Ann Arbor, MI: Univ. of Michigan Press.
Google Scholar
Holland, J.H. (1986). Escaping brittleness: The possibilities of general-purpose learning algorithms applied to parallel rule-based systems. In: R.S. Michalski, J.G. Carbonell, & T.M. Mitchell (Eds.),Machine learning, An artificial intelligence approach, Volume II 593–623, Los Altos, CA: Morgan Kaufman.
Google Scholar
Kaelbling, L.P. (1990).Learning in embedded systems. Ph.D. dissertation, Computer Science Dept, Stanford University.
Mahadevan, S. & Connell, J. (1990). Automatic programming of behavior-based robots using reinforcement learning. IBM technical report. To appear inArtificial Intelligence.
Minsky, M.L. (1961). Steps toward artificial intelligence.Proceedings IRE, 49 8–30. Reprinted in E.A. Feigenbaum & J. Feldman (Eds.),Computers and Thought, 406–450, New York: McGraw-Hill, 1963.
Google Scholar
Narendra, K.S. & Thathachar, M.A.L. (1974). Learning automata—a survey.IEEE Transactions on Systems, Man, and Cybernetics, 4 323–334. (Or see their textbook,Learning Automata: An Introduction, Englewood Cliffs, NJ: Prentice Hall, 1989.)
Google Scholar
Samuel, A.L. (1959). Some studies in machine learning using the game of checkers.IBM Journal of Research and Development, 3 210–229. Reprinted in E.A. Feigenbaum & J. Feldman (Eds.),Computers and Thought, 71–105, New York: McGraw-Hill, 1963.
Google Scholar
Waltz, M.D. & Fu, K.S. (1965). A heuristic approach to reinforcement learning control systems.IEEE Transactions on Automatic Control, AC-10 390–398.
Google Scholar
Watkins, C.J.C.H. (1989).Learning with delayed rewards. Ph.D. dissertation, Psychology Department, Cambridge University.
Werbos, P.J. (1987). Building and understanding adaptive systems: A statistical/numerical approach to factory automation and brain research.IEEE Transactions on Systems, Man and Cybernetics, Jan–Feb.
Whitehead, S.D. & Ballard, D.H. (1991). Learning to perceive and act by trial and error.Machine Learning, 7 45–84.
Google Scholar

Download references

Author information

Authors and Affiliations

GTE Laboratories Incorporated, 02254, Waltham, MA
Richard S. Sutton

Authors

Richard S. Sutton
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sutton, R.S. Introduction: The challenge of reinforcement learning. Mach Learn 8, 225–227 (1992). https://doi.org/10.1007/BF00992695

Download citation

Issue Date: May 1992
DOI: https://doi.org/10.1007/BF00992695

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction: The challenge of reinforcement learning

Article PDF

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation