Task-Driven Discretization of the Joint Space of Visual Percepts and Continuous Actions

Jodogne, Sébastien; Piater, Justus H.

doi:10.1007/11871842_24

Sébastien Jodogne²¹ &
Justus H. Piater²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4212))

Included in the following conference series:

European Conference on Machine Learning

5482 Accesses
2 Citations

Abstract

We target the problem of closed-loop learning of control policies that map visual percepts to continuous actions. Our algorithm, called Reinforcement Learning of Joint Classes (RLJC), adaptively discretizes the joint space of visual percepts and continuous actions. In a sequence of attempts to remove perceptual aliasing, it incrementally builds a decision tree that applies tests either in the input perceptual space or in the output action space. The leaves of such a decision tree induce a piecewise constant, optimal state-action value function, which is computed through a reinforcement learning algorithm that uses the tree as a function approximator. The optimal policy is then derived by selecting the action that, given a percept, leads to the leaf that maximizes the value function. Our approach is quite general and applies also to learning mappings from continuous percepts to continuous actions. A simulated visual navigation problem illustrates the applicability of RLJC.

Download to read the full chapter text

Chapter PDF

Decision-tree analysis of control strategies

Article 15 October 2014

Learning Long-Horizon Robot Exploration Strategies for Multi-object Search in Continuous Action Spaces

Self-directed Lifelong Learning for Robot Vision

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Bertsekas, D., Tsitsiklis, J.: Neuro-Dynamic Programming. Athena Scient. (1996)
Google Scholar
Sutton, R., Barto, A.: Reinforcement Learning, an Introduction. MIT Press, Cambridge (1998)
Google Scholar
Gross, H.M., Stephan, V., Krabbes, M.: A neural field approach to topological reinforcement learning in continuous action spaces. In: Proc. of the IEEE World Congress on Computational Intelligence, vol. 3, pp. 1992–1997 (1998)
Google Scholar
Santamaria, J., Sutton, R., Ram, A.: Experiments with reinforcement learning in problems with continuous state and action spaces. Adaptive Behavior 6(2), 163–218 (1998)
Article Google Scholar
Gaskett, C., Wettergreen, D., Zelinsky, A.: Q-learning in continuous state and action spaces. In: Australian Joint Conf. on Artificial Intelligence, pp. 417–428 (1999)
Google Scholar
Jodogne, S., Piater, J.: Interactive learning of mappings from visual percepts to actions. In: De Raedt, L., Wrobel, S. (eds.) Proc. of the 22nd Intern. Conf. on Machine Learning (ICML), Bonn, Germany, pp. 393–400. ACM Press, New York (2005)
Chapter Google Scholar
Monson, C., Wingate, D., Seppi, K., Peterson, T.: Variable resolution discretization in the joint space. In: Intern. Conf. on Machine Learning and Applications (2004)
Google Scholar
Munos, R., Moore, A.: Variable resolution discretization in optimal control. Machine Learning 49, 291–323 (2002)
Article MATH Google Scholar
Whitehead, S., Ballard, D.: Learning to perceive and act by trial and error. Machine Learning 7, 45–83 (1991)
Google Scholar
Breiman, L., Friedman, J., Stone, C.: Classification and Regression Trees. Wadsworth Intern. Group (1984)
Google Scholar
Schmid, C., Mohr, R., Bauckhage, C.: Evaluation of interest point detectors. Intern. Journal of Computer Vision 37(2), 151–172 (2000)
Article MATH Google Scholar
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. In: Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition, Madison, WI, USA, vol. 2, pp. 257–263 (2003)
Google Scholar
Ernst, D., Geurts, P., Wehenkel, L.: Tree-based batch mode reinforcement learning. Journal of Machine Learning Research 6, 503–556 (2005)
MathSciNet Google Scholar
Lowe, D.: Distinctive image features from scale-invariant keypoints. Intern. Journal of Computer Vision 60(2), 91–110 (2004)
Article Google Scholar
Coelho, J., Piater, J., Grupen, R.: Developing haptic and visual perceptual categories for reaching and grasping with a humanoid robot. Robotics and Autonomous Systems, special issue on Humanoid Robots 37(2-3), 195–218 (2001)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Montefiore Institute (B28), University of Liège, B-4000, Liège, Belgium
Sébastien Jodogne & Justus H. Piater

Authors

Sébastien Jodogne
View author publications
You can also search for this author in PubMed Google Scholar
Justus H. Piater
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Knowledge Engineering Group, Technische Universität Darmstadt,
Johannes Fürnkranz
Max Planck Institute for Computer Science, Saarbrücken, Germany
Tobias Scheffer
Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Germany
Myra Spiliopoulou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jodogne, S., Piater, J.H. (2006). Task-Driven Discretization of the Joint Space of Visual Percepts and Continuous Actions. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds) Machine Learning: ECML 2006. ECML 2006. Lecture Notes in Computer Science(), vol 4212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11871842_24

Download citation

DOI: https://doi.org/10.1007/11871842_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45375-8
Online ISBN: 978-3-540-46056-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Task-Driven Discretization of the Joint Space of Visual Percepts and Continuous Actions

Abstract

Chapter PDF

Similar content being viewed by others

Decision-tree analysis of control strategies

Learning Long-Horizon Robot Exploration Strategies for Multi-object Search in Continuous Action Spaces

Self-directed Lifelong Learning for Robot Vision

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Task-Driven Discretization of the Joint Space of Visual Percepts and Continuous Actions

Abstract

Chapter PDF

Similar content being viewed by others

Decision-tree analysis of control strategies

Learning Long-Horizon Robot Exploration Strategies for Multi-object Search in Continuous Action Spaces

Self-directed Lifelong Learning for Robot Vision

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation