Skip to main content

Pyramid Representations of the Set of Actions in Reinforcement Learning

  • Conference paper
Bioinspired Computation in Artificial Systems (IWINAC 2015)

Abstract

Future robot systems will perform increasingly complex tasks in decreasingly well-structured and known environments. Robots will need to adapt their hardware and software, first only to foreseen, but ultimately to more complex changes of the environment. In this paper we describe a learning strategy based on reinforcement which allows fast robot learning from scratch using only its interaction with the environment, even when the reward is provided by a human observer and therefore is highly non-deterministic and noisy. To get this our proposal uses a novel representation of the action space together with an ensemble of learners able to forecast the time interval before a robot failure

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Quintia, P., Iglesias, R., Rodriguez, M.A., Regueiro, C.V.: “Simultaneus learning of perception and action in mobile robots2. Robotics and Autonomous Systems 58(12), 1306–1315 (2010)

    Article  Google Scholar 

  2. Lazebnik, S., Schmid, C., Ponce, J.: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories. In: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), pp. 2169–2178 (2006)

    Google Scholar 

  3. Kristo, K., Chua, C.S.: Image representation for object recognition: utilizing overlapping windows in Spatial Pyramid Matching. In: 20th IEEE International Conference on Image Processing (ICIP) (2013)

    Google Scholar 

  4. Quintia Vidal, P., Iglesias Rodriguez, R., Rodriguez Gonzalez, M.A., Vazquez Regueiro, C.: Learning on real robots from experience and simple user feedback. Journal of Physical Agents 7(1) (2013)

    Google Scholar 

  5. Carpenter, C.A., Grossberg, S., Rosen, D.B.: Fuzzy art: Fast stable learning and categorization of analog pattern by an adaptive resonance system. Neural Network 4(6), 759–771 (1991)

    Article  Google Scholar 

  6. Barto, A.G.: Reinforcement learning: An introduction. MIT press (1998)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to R. Iglesias .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Iglesias, R., Alvarez-Santos, V., Rodriguez, M.A., Santos-Saavedra, D., Regueiro, C.V., Pardo, X.M. (2015). Pyramid Representations of the Set of Actions in Reinforcement Learning. In: Ferrández Vicente, J., Álvarez-Sánchez, J., de la Paz López, F., Toledo-Moreo, F., Adeli, H. (eds) Bioinspired Computation in Artificial Systems. IWINAC 2015. Lecture Notes in Computer Science(), vol 9108. Springer, Cham. https://doi.org/10.1007/978-3-319-18833-1_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-18833-1_22

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-18832-4

  • Online ISBN: 978-3-319-18833-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics