Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models

Lou, Xingzhou; Zhang, Junge; Wang, Ziyan; Huang, Kaiqi; Du, Yali

Computer Science > Machine Learning

arXiv:2401.07553 (cs)

[Submitted on 15 Jan 2024 (v1), last revised 19 Apr 2024 (this version, v2)]

Title:Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models

Authors:Xingzhou Lou, Junge Zhang, Ziyan Wang, Kaiqi Huang, Yali Du

View PDF HTML (experimental)

Abstract:Safe reinforcement learning (RL) agents accomplish given tasks while adhering to specific constraints. Employing constraints expressed via easily-understandable human language offers considerable potential for real-world applications due to its accessibility and non-reliance on domain expertise. Previous safe RL methods with natural language constraints typically adopt a recurrent neural network, which leads to limited capabilities when dealing with various forms of human language input. Furthermore, these methods often require a ground-truth cost function, necessitating domain expertise for the conversion of language constraints into a well-defined cost function that determines constraint violation. To address these issues, we proposes to use pre-trained language models (LM) to facilitate RL agents' comprehension of natural language constraints and allow them to infer costs for safe policy learning. Through the use of pre-trained LMs and the elimination of the need for a ground-truth cost, our method enhances safe policy learning under a diverse set of human-derived free-form natural language constraints. Experiments on grid-world navigation and robot control show that the proposed method can achieve strong performance while adhering to given constraints. The usage of pre-trained LMs allows our method to comprehend complicated constraints and learn safe policies without the need for ground-truth cost at any stage of training or evaluation. Extensive ablation studies are conducted to demonstrate the efficacy of each part of our method.

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:2401.07553 [cs.LG]
	(or arXiv:2401.07553v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2401.07553

Submission history

From: Xingzhou Lou [view email]
[v1] Mon, 15 Jan 2024 09:37:03 UTC (3,770 KB)
[v2] Fri, 19 Apr 2024 05:48:11 UTC (3,658 KB)

Computer Science > Machine Learning

Title:Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators