Dynamic Programming with NAR Model versus Q-learning — Case Study

Chrobak, Jarosław; Pacut, Andrzej

doi:10.1007/978-3-7908-1902-1_113

Jarosław Chrobak⁴ &
Andrzej Pacut⁴

Part of the book series: Advances in Soft Computing ((AINSC,volume 19))

488 Accesses

Abstract

Two approaches to control policy synthesis for unknown systems are investigated. An indirect approach is based on adaptive identification of a neural network model in the NAR form (nonlinear autoregresion model) followed by application of the dynamic programming to this model. A direct approach consists of Q-learning with the use of a lookup table. Both methods were applied to optimization of a stock portfolio problem and tested on Warsaw Stock Exchange data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

D.P. Bertsekas and J. Tsitsiklis, Neuro-Dynamic Programming, Athena Scientific, Belmont, Mass., 1996.
MATH Google Scholar
D.P. Bertsekas, Dynamic Programming and Optimal Control, Athena Scientific, Belmont, Mass., 1995.
MATH Google Scholar
S. Haykin, Neural Networks - A Comprehensive Foundation Macmillan College Publishing Company, 1994.
Google Scholar
K.S. Narendra, Neural Networks for Control: Theory and Practice, Proceedings of the IEEE, Vol. 84, No. 10, pp. 1385–1407, 1996.
Article Google Scholar
C.J.C.H. Watkins and P. Dayan, Technical Note: Q Learning, Machine Learning, Vol. 8, pp. 279–492, 1992.
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Warsaw University of Technology, Nowowiejska 15/19, Warsaw, 00-665, Poland
Jarosław Chrobak & Andrzej Pacut

Authors

Jarosław Chrobak
View author publications
You can also search for this author in PubMed Google Scholar
Andrzej Pacut
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Engineering, Technical University Częstochowa, Al. Armii Krajowej 36, 42-200, Częstochowa, Poland
Leszek Rutkowski
Systems Research Institute, Polish Academy of Sciences, ul. Newelska 6, 01-447, Warsaw, Poland
Janusz Kacprzyk

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chrobak, J., Pacut, A. (2003). Dynamic Programming with NAR Model versus Q-learning — Case Study. In: Rutkowski, L., Kacprzyk, J. (eds) Neural Networks and Soft Computing. Advances in Soft Computing, vol 19. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1902-1_113

Download citation

DOI: https://doi.org/10.1007/978-3-7908-1902-1_113
Publisher Name: Physica, Heidelberg
Print ISBN: 978-3-7908-0005-0
Online ISBN: 978-3-7908-1902-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics