Contrastive Learning as Goal-Conditioned Reinforcement Learning

Eysenbach, Benjamin; Zhang, Tianjun; Salakhutdinov, Ruslan; Levine, Sergey

Computer Science > Machine Learning

arXiv:2206.07568 (cs)

[Submitted on 15 Jun 2022 (v1), last revised 17 Feb 2023 (this version, v2)]

Title:Contrastive Learning as Goal-Conditioned Reinforcement Learning

Authors:Benjamin Eysenbach, Tianjun Zhang, Ruslan Salakhutdinov, Sergey Levine

View PDF

Abstract:In reinforcement learning (RL), it is easier to solve a task if given a good representation. While deep RL should automatically acquire such good representations, prior work often finds that learning representations in an end-to-end fashion is unstable and instead equip RL algorithms with additional representation learning parts (e.g., auxiliary losses, data augmentation). How can we design RL algorithms that directly acquire good representations? In this paper, instead of adding representation learning parts to an existing RL algorithm, we show (contrastive) representation learning methods can be cast as RL algorithms in their own right. To do this, we build upon prior work and apply contrastive representation learning to action-labeled trajectories, in such a way that the (inner product of) learned representations exactly corresponds to a goal-conditioned value function. We use this idea to reinterpret a prior RL method as performing contrastive learning, and then use the idea to propose a much simpler method that achieves similar performance. Across a range of goal-conditioned RL tasks, we demonstrate that contrastive RL methods achieve higher success rates than prior non-contrastive methods, including in the offline RL setting. We also show that contrastive RL outperforms prior methods on image-based tasks, without using data augmentation or auxiliary objectives.

Comments:	NeurIPS 2022. Code is available on the website: this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2206.07568 [cs.LG]
	(or arXiv:2206.07568v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2206.07568

Submission history

From: Benjamin Eysenbach [view email]
[v1] Wed, 15 Jun 2022 14:34:15 UTC (15,111 KB)
[v2] Fri, 17 Feb 2023 21:53:23 UTC (20,744 KB)

Computer Science > Machine Learning

Title:Contrastive Learning as Goal-Conditioned Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Contrastive Learning as Goal-Conditioned Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators