Meta-Value Learning: a General Framework for Learning with Learning Awareness

Cooijmans, Tim; Aghajohari, Milad; Courville, Aaron

Computer Science > Machine Learning

arXiv:2307.08863 (cs)

[Submitted on 17 Jul 2023 (v1), last revised 11 Dec 2023 (this version, v3)]

Title:Meta-Value Learning: a General Framework for Learning with Learning Awareness

Authors:Tim Cooijmans, Milad Aghajohari, Aaron Courville

View PDF HTML (experimental)

Abstract:Gradient-based learning in multi-agent systems is difficult because the gradient derives from a first-order model which does not account for the interaction between agents' learning processes. LOLA (arXiv:1709.04326) accounts for this by differentiating through one step of optimization. We propose to judge joint policies by their long-term prospects as measured by the meta-value, a discounted sum over the returns of future optimization iterates. We apply a form of Q-learning to the meta-game of optimization, in a way that avoids the need to explicitly represent the continuous action space of policy updates. The resulting method, MeVa, is consistent and far-sighted, and does not require REINFORCE estimators. We analyze the behavior of our method on a toy game and compare to prior work on repeated matrix games.

Subjects:	Machine Learning (cs.LG); Multiagent Systems (cs.MA)
Cite as:	arXiv:2307.08863 [cs.LG]
	(or arXiv:2307.08863v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2307.08863

Submission history

From: Tim Cooijmans [view email]
[v1] Mon, 17 Jul 2023 21:40:57 UTC (571 KB)
[v2] Mon, 4 Sep 2023 13:46:41 UTC (1,803 KB)
[v3] Mon, 11 Dec 2023 16:52:51 UTC (3,039 KB)

Computer Science > Machine Learning

Title:Meta-Value Learning: a General Framework for Learning with Learning Awareness

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Meta-Value Learning: a General Framework for Learning with Learning Awareness

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators