Goal-Conditioned Variational Autoencoder Trajectory Primitives with Continuous and Discrete Latent Codes

Osa, Takayuki; Ikemoto, Shuehi

doi:10.1007/s42979-020-00324-7

Goal-Conditioned Variational Autoencoder Trajectory Primitives with Continuous and Discrete Latent Codes

Original Research
Published: 16 September 2020

Volume 1, article number 303, (2020)
Cite this article

SN Computer Science Aims and scope Submit manuscript

855 Accesses
4 Citations
2 Altmetric
Explore all metrics

Abstract

Imitation learning is an intuitive approach for teaching motion to robotic systems. Although previous studies have proposed various methods to model demonstrated movement primitives, one of the limitations of existing methods is that the shape of the trajectories is encoded in high dimensional space. The high dimensionality of the trajectory representation can be a bottleneck in the subsequent process such as planning a sequence of primitive motions. We address this problem by learning the latent space of the robot trajectory. If the latent variable of the trajectories can be learned, it can be used to tune the trajectory in an intuitive manner even when the user is not an expert. We propose a framework for modeling demonstrated trajectories with a neural network that learns the low-dimensional latent space. Our neural network structure is built on the variational autoencoder (VAE) with discrete and continuous latent variables. We extend the structure of the existing VAE to obtain the decoder that is conditioned on the goal position of the trajectory for generalization to different goal positions. Although the inference performed by VAE is not accurate, the positioning error at the generalized goal position can be reduced to less than 1 mm by incorporating the projection onto the solution space. To cope with requirement of the massive training data, we use a trajectory augmentation technique inspired by the data augmentation commonly used in the computer vision community. In the proposed framework, the latent variables that encodes the multiple types of trajectories are learned in an unsupervised manner, although existing methods usually require label information to model diverse behaviors. The learned decoder can be used as a motion planner in which the user can specify the goal position and the trajectory types by setting the latent variables. The experimental results show that our neural network can be trained using a limited number of demonstrated trajectories and that the interpretable latent representations can be learned.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Path Planning and Trajectory Planning Algorithms: A General Overview

InterGen: Diffusion-Based Multi-human Motion Generation Under Complex Interactions

Article 26 March 2024

A review of motion planning algorithms for intelligent robots

Article Open access 25 November 2021

Notes

We customized the color to show the motion more clearly.
Available at https://github.com/Schlumberger/joint-vae.
Please note that the frames shown in the figures are not synchronized due to the limitation of our implementation.

References

Osa T, Pajarinen J, Neumann G, Bagnell JA, Abbeel P, Peters J. An algorithmic perspective on imitation learning. Found Trends Robot. 2018;7(1–2):1–179.
Google Scholar
Ijspeert AJ, Nakanishi J, Schaal S. Learning attractor landscapes for learning motor primitives. In Advances in neural information processing systems (NIPS). New York: Springer; 2002.
Google Scholar
Ijspeert AJ, Nakanishi J, Hoffmann H, Pastor P, Schaal S. Dynamical movement primitives: learning attractor models for motor behaviors. Neural Comput. 2013;25(2):328–73.
Article MathSciNet Google Scholar
Paraschos A, Daniel C, Peters J, Neumann G. Probabilistic movement primitives. In: Proceedings of advances in neural information processing systems (NIPS). MIT Press; 2013.
Huang Y, Rozo L, Andand JS, Caldwell DG. Kernelized movement primitives. Int J Robot Res. 2019;38:833.
Article Google Scholar
Kingma DP, Welling M. Auto-encoding variational Bayes. In: Proceedings of the international conference on learning representations (ICLR), 2014.
Goodfellow IJ, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y. Generative adversarial nets. Advances in neural information processing systems (NIPS). New York: Springer; 2014.
Google Scholar
Dupont E. Learning disentangled joint continuous and discrete representations. In: Advances in neural information processing systems 31 (NIPS 2018)), 2018.
Zucker M, Ratliff N, Dragan A, Pivtoraiko M, Klingensmith M, Dellin C, Bagnell JA, Srinivasa S. Chomp: Covariant Hamiltonian optimization for motion planning. Int J Robot Res. 2013;32:1164–93.
Article Google Scholar
Schulman J, Duan Y, Ho J, Lee A, Awwal I, Bradlow H, Pan J, Patil S, Goldberg K, Abbeel P. Motion planning with sequential convex optimization and convex collision checking. Int J Robot Res. 2014;33(9):1251–70.
Article Google Scholar
Osa T, Sugita N, Mitsuishi M. Online trajectory planning and force control for automation of surgical tasks. IEEE Trans Autom Sci Eng. 2018;15(2):675–91.
Article Google Scholar
Shon A, Grochow K, Hertzmann A, Rao RP. Learning shared latent structure for image synthesis and robotic imitation. In: Advances in neural information processing systems (NIPS), 2005.
Grimes DB, Rao RP. Learning nonparametric policies by imitation. In: Proceedings of the IEEE/RSJ international conference on intelligent robots and systems (IROS), 2008.
Lawrence N. Probabilistic non-linear principal component analysis with Gaussian process latent variable models. J Mach Learn Res. 2005;6:1783–816.
MathSciNet MATH Google Scholar
Levine S, Finn C, Darrell T, Abbeel P. End-to-end training of deep visuomotor policies. J Mach Learn Res. 2016;17(39):1–40.
MathSciNet MATH Google Scholar
Haarnoja T, Pong VH, Zhou A, Dalal M, Abbeel P, Levine S. Composable deep reinforcement learning for robotic manipulation. In: Proceedings of the IEEE conference on robotics and automation (ICRA), 2018.
Merel J, Hasenclever L, Galashov A, Ahuja A, Pham V, Wayne YWTG, Heess N. Neural probabilistic motor primitives for humanoid control. In: Proceedings of the international conference on learning representations (ICLR), 2019.
Levine S. Reinforcement learning and control as probabilistic inference: tutorial and review 2018. arXiv.
Arnold S, Yamazaki K. Fast and flexible multi-step cloth manipulation planning using an encode-manipulate-decode network (EM*D net). Front Neurorobot. 2019;13:22.
Article Google Scholar
Higgins I, Matthey L, Pal A, Burgess C, Glorot X, Botvinick M, Mohamed S, Lerchner A. beta-vae: Learning basic visual concepts with a constrained variational framework. In: Proceedings of the international conference on learning representations (ICLR), 2016.
Kingma DP, Mohamed S, Rezende DJ, Welling M. Semi-supervised learning with deep generative models. In: Advances in neural information processing systems (NIPS), 2014.
Maddison CJ, Mnih A, Teh YW. The concrete distribution: a continuous relaxation of discrete random variables. In: Proceedings of the international conference on learning representations (ICLR), 2017.
Kalakrishnan M, Chitta S, Theodorou E, Pastor P, Schaal S. Stomp: Stochastic trajectory optimization for motion planning. In: Proceedings of the IEEE international conference on robotics and automation (ICRA), 2011. p. 4569–4574.
Dragan AD, Muelling K, Bagnell JA, Srinivasa SS. Movement primitives via optimization. In: Proceedings of the IEEE international conference on robotics and automation (ICRA), 2015. p. 2339–2346.
Rohmer E, Signgh SPN, Freese M. V-rep: a versatile and scalable robot simulation framework. In: Proceedings of IEEE/RSJ international conference on intelligent robots and systems (IROS), 2013.
Cremer C, Li X, Duvenaud D. Inference suboptimality in variational autoencoders. In: Proceedings of the international conference on machine learning (ICML), 2018.

Download references

Funding

T.O. was supported by JSPS KAKENHI Grant number 19K20370, and S.I. was supported by JSPS KAKENHI Grant number 18H01410 and 19K22875.

Author information

Authors and Affiliations

Department of Human Intelligence Systems and Research Center for Neuromorphic AI Hardware, Kyushu Institute of Technology, Fukuoka, Japan
Takayuki Osa & Shuehi Ikemoto
RIKEN Center for Advanced Intelligence Project, Tokyo, Japan
Takayuki Osa

Authors

Takayuki Osa
View author publications
You can also search for this author in PubMed Google Scholar
Shuehi Ikemoto
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Takayuki Osa.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Osa, T., Ikemoto, S. Goal-Conditioned Variational Autoencoder Trajectory Primitives with Continuous and Discrete Latent Codes. SN COMPUT. SCI. 1, 303 (2020). https://doi.org/10.1007/s42979-020-00324-7

Download citation

Received: 13 July 2020
Accepted: 07 September 2020
Published: 16 September 2020
DOI: https://doi.org/10.1007/s42979-020-00324-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Goal-Conditioned Variational Autoencoder Trajectory Primitives with Continuous and Discrete Latent Codes

Abstract

Access this article

Similar content being viewed by others

Path Planning and Trajectory Planning Algorithms: A General Overview

InterGen: Diffusion-Based Multi-human Motion Generation Under Complex Interactions

A review of motion planning algorithms for intelligent robots

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Goal-Conditioned Variational Autoencoder Trajectory Primitives with Continuous and Discrete Latent Codes

Abstract

Access this article

Similar content being viewed by others

Path Planning and Trajectory Planning Algorithms: A General Overview

InterGen: Diffusion-Based Multi-human Motion Generation Under Complex Interactions

A review of motion planning algorithms for intelligent robots

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation