Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies

Pan, Liangming; Saxon, Michael; Xu, Wenda; Nathani, Deepak; Wang, Xinyi; Wang, William Yang

Computer Science > Computation and Language

arXiv:2308.03188 (cs)

[Submitted on 6 Aug 2023 (v1), last revised 30 Aug 2023 (this version, v2)]

Title:Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies

Authors:Liangming Pan, Michael Saxon, Wenda Xu, Deepak Nathani, Xinyi Wang, William Yang Wang

View PDF

Abstract:Large language models (LLMs) have demonstrated remarkable performance across a wide array of NLP tasks. However, their efficacy is undermined by undesired and inconsistent behaviors, including hallucination, unfaithful reasoning, and toxic content. A promising approach to rectify these flaws is self-correction, where the LLM itself is prompted or guided to fix problems in its own output. Techniques leveraging automated feedback -- either produced by the LLM itself or some external system -- are of particular interest as they are a promising way to make LLM-based solutions more practical and deployable with minimal human feedback. This paper presents a comprehensive review of this emerging class of techniques. We analyze and taxonomize a wide array of recent work utilizing these strategies, including training-time, generation-time, and post-hoc correction. We also summarize the major applications of this strategy and conclude by discussing future directions and challenges.

Comments:	Work in Progress. Version 2
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2308.03188 [cs.CL]
	(or arXiv:2308.03188v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2308.03188

Submission history

From: Liangming Pan [view email]
[v1] Sun, 6 Aug 2023 18:38:52 UTC (1,032 KB)
[v2] Wed, 30 Aug 2023 03:47:34 UTC (1,039 KB)

Computer Science > Computation and Language

Title:Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators