Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning

Chen, Xiang; Li, Lei; Zhang, Ningyu; Liang, Xiaozhuan; Deng, Shumin; Tan, Chuanqi; Huang, Fei; Si, Luo; Chen, Huajun

Computer Science > Computation and Language

arXiv:2205.14704 (cs)

[Submitted on 29 May 2022 (v1), last revised 19 Sep 2023 (this version, v5)]

Title:Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning

Authors:Xiang Chen, Lei Li, Ningyu Zhang, Xiaozhuan Liang, Shumin Deng, Chuanqi Tan, Fei Huang, Luo Si, Huajun Chen

View PDF

Abstract:Prompt learning approaches have made waves in natural language processing by inducing better few-shot performance while they still follow a parametric-based learning paradigm; the oblivion and rote memorization problems in learning may encounter unstable generalization issues. Specifically, vanilla prompt learning may struggle to utilize atypical instances by rote during fully-supervised training or overfit shallow patterns with low-shot data. To alleviate such limitations, we develop RetroPrompt with the motivation of decoupling knowledge from memorization to help the model strike a balance between generalization and memorization. In contrast with vanilla prompt learning, RetroPrompt constructs an open-book knowledge-store from training instances and implements a retrieval mechanism during the process of input, training and inference, thus equipping the model with the ability to retrieve related contexts from the training corpus as cues for enhancement. Extensive experiments demonstrate that RetroPrompt can obtain better performance in both few-shot and zero-shot settings. Besides, we further illustrate that our proposed RetroPrompt can yield better generalization abilities with new datasets. Detailed analysis of memorization indeed reveals RetroPrompt can reduce the reliance of language models on memorization; thus, improving generalization for downstream tasks. Code is available in this https URL.

Comments:	NeurIPS 2022 (Spotlight)
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as:	arXiv:2205.14704 [cs.CL]
	(or arXiv:2205.14704v5 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2205.14704

Submission history

From: Ningyu Zhang [view email]
[v1] Sun, 29 May 2022 16:07:30 UTC (1,213 KB)
[v2] Wed, 10 Aug 2022 13:24:19 UTC (1,091 KB)
[v3] Tue, 11 Oct 2022 14:06:11 UTC (1,123 KB)
[v4] Thu, 27 Jul 2023 04:07:02 UTC (1,123 KB)
[v5] Tue, 19 Sep 2023 12:33:09 UTC (1,123 KB)

Computer Science > Computation and Language

Title:Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators