Pyramid Representations of the Set of Actions in Reinforcement Learning

Iglesias, R.; Alvarez-Santos, V.; Rodriguez, M. A.; Santos-Saavedra, D.; Regueiro, C. V.; Pardo, X. M.

doi:10.1007/978-3-319-18833-1_22

R. Iglesias¹⁸,
V. Alvarez-Santos¹⁹,
M. A. Rodriguez¹⁸,
D. Santos-Saavedra¹⁸,
C. V. Regueiro²⁰ &
…
X. M. Pardo¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9108))

Included in the following conference series:

International Work-Conference on the Interplay Between Natural and Artificial Computation

1904 Accesses

Abstract

Future robot systems will perform increasingly complex tasks in decreasingly well-structured and known environments. Robots will need to adapt their hardware and software, first only to foreseen, but ultimately to more complex changes of the environment. In this paper we describe a learning strategy based on reinforcement which allows fast robot learning from scratch using only its interaction with the environment, even when the reward is provided by a human observer and therefore is highly non-deterministic and noisy. To get this our proposal uses a novel representation of the action space together with an ensemble of learners able to forecast the time interval before a robot failure

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Quintia, P., Iglesias, R., Rodriguez, M.A., Regueiro, C.V.: “Simultaneus learning of perception and action in mobile robots2. Robotics and Autonomous Systems 58(12), 1306–1315 (2010)
Article Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories. In: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), pp. 2169–2178 (2006)
Google Scholar
Kristo, K., Chua, C.S.: Image representation for object recognition: utilizing overlapping windows in Spatial Pyramid Matching. In: 20th IEEE International Conference on Image Processing (ICIP) (2013)
Google Scholar
Quintia Vidal, P., Iglesias Rodriguez, R., Rodriguez Gonzalez, M.A., Vazquez Regueiro, C.: Learning on real robots from experience and simple user feedback. Journal of Physical Agents 7(1) (2013)
Google Scholar
Carpenter, C.A., Grossberg, S., Rosen, D.B.: Fuzzy art: Fast stable learning and categorization of analog pattern by an adaptive resonance system. Neural Network 4(6), 759–771 (1991)
Article Google Scholar
Barto, A.G.: Reinforcement learning: An introduction. MIT press (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

CITIUS, Universidade de Santiago de Compostela, Santiago de Compostela, Spain
R. Iglesias, M. A. Rodriguez, D. Santos-Saavedra & X. M. Pardo
Situm Technologies S. L., Santiago de Compostela, Spain
V. Alvarez-Santos
Department of Electronics and Systems, Universidade da Coruña, A Coruña, Spain
C. V. Regueiro

Authors

R. Iglesias
View author publications
You can also search for this author in PubMed Google Scholar
V. Alvarez-Santos
View author publications
You can also search for this author in PubMed Google Scholar
M. A. Rodriguez
View author publications
You can also search for this author in PubMed Google Scholar
D. Santos-Saavedra
View author publications
You can also search for this author in PubMed Google Scholar
C. V. Regueiro
View author publications
You can also search for this author in PubMed Google Scholar
X. M. Pardo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R. Iglesias .

Editor information

Editors and Affiliations

Universidad Politécnica de Cartagena, Cartagena, Spain
José Manuel Ferrández Vicente
Universidad Nacional de Educación a Distancia, Madrid, Spain
José Ramón Álvarez-Sánchez
Universidad Nacional de Educación a Distancia, Madrid, Spain
Félix de la Paz López
Universidad Politécnica de Cartagena, Cartagena, Spain
Fco. Javier Toledo-Moreo
Dept. Civil & Environmental Engin. Geodetic Science, Ohio State University, Columbus, Ohio, USA
Hojjat Adeli

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Iglesias, R., Alvarez-Santos, V., Rodriguez, M.A., Santos-Saavedra, D., Regueiro, C.V., Pardo, X.M. (2015). Pyramid Representations of the Set of Actions in Reinforcement Learning. In: Ferrández Vicente, J., Álvarez-Sánchez, J., de la Paz López, F., Toledo-Moreo, F., Adeli, H. (eds) Bioinspired Computation in Artificial Systems. IWINAC 2015. Lecture Notes in Computer Science(), vol 9108. Springer, Cham. https://doi.org/10.1007/978-3-319-18833-1_22

Download citation

DOI: https://doi.org/10.1007/978-3-319-18833-1_22
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-18832-4
Online ISBN: 978-3-319-18833-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics