X-MEN: Guaranteed XOR-Maximum Entropy Constrained Inverse Reinforcement Learning

Ding, Fan; Xue, Yeiang

Computer Science > Machine Learning

arXiv:2203.11842 (cs)

[Submitted on 22 Mar 2022]

Title:X-MEN: Guaranteed XOR-Maximum Entropy Constrained Inverse Reinforcement Learning

Authors:Fan Ding, Yeiang Xue

View PDF

Abstract:Inverse Reinforcement Learning (IRL) is a powerful way of learning from demonstrations. In this paper, we address IRL problems with the availability of prior knowledge that optimal policies will never violate certain constraints. Conventional approaches ignoring these constraints need many demonstrations to converge. We propose XOR-Maximum Entropy Constrained Inverse Reinforcement Learning (X-MEN), which is guaranteed to converge to the optimal policy in linear rate w.r.t. the number of learning iterations. X-MEN embeds XOR-sampling -- a provable sampling approach that transforms the #P complete sampling problem into queries to NP oracles -- into the framework of maximum entropy IRL. X-MEN also guarantees the learned policy will never generate trajectories that violate constraints. Empirical results in navigation demonstrate that X-MEN converges faster to the optimal policies compared to baseline approaches and always generates trajectories that satisfy multi-state combinatorial constraints.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2203.11842 [cs.LG]
	(or arXiv:2203.11842v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2203.11842

Submission history

From: Fan Ding [view email]
[v1] Tue, 22 Mar 2022 16:09:42 UTC (523 KB)

Computer Science > Machine Learning

Title:X-MEN: Guaranteed XOR-Maximum Entropy Constrained Inverse Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:X-MEN: Guaranteed XOR-Maximum Entropy Constrained Inverse Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators