Exploring Demonstration Ensembling for In-context Learning

Khalifa, Muhammad; Logeswaran, Lajanugen; Lee, Moontae; Lee, Honglak; Wang, Lu

Computer Science > Computation and Language

arXiv:2308.08780 (cs)

[Submitted on 17 Aug 2023 (v1), last revised 21 Aug 2023 (this version, v2)]

Title:Exploring Demonstration Ensembling for In-context Learning

Authors:Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Lu Wang

View PDF

Abstract:In-context learning (ICL) operates by showing language models (LMs) examples of input-output pairs for a given task, i.e., demonstrations. The standard approach for ICL is to prompt the LM with concatenated demonstrations followed by the test input. This approach suffers from some issues. First, concatenation offers almost no control over the contribution of each demo to the model prediction. This can be sub-optimal when some demonstrations are irrelevant to the test example. Second, due to the input length limit of some transformer models, it might be infeasible to fit many examples into the context, especially when dealing with long-input tasks. In this work, we explore Demonstration Ensembling (DENSE) as an alternative to simple concatenation. DENSE predicts outputs using subsets (i.e., buckets) of the demonstrations and then combines the output probabilities resulting from each subset to produce the final prediction. We study different ensembling methods using GPT-j and experiment on 12 language tasks. Our experiments show weighted max ensembling to outperform vanilla concatenation by as large as 2.4 average points. Code available at this https URL.

Comments:	Published at ME-FoMo workshop at ICLR 2023. Arxiv version includes evaluation on 5 more tasks
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2308.08780 [cs.CL]
	(or arXiv:2308.08780v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2308.08780

Submission history

From: Muhammad Khalifa [view email]
[v1] Thu, 17 Aug 2023 04:45:19 UTC (3,244 KB)
[v2] Mon, 21 Aug 2023 01:25:30 UTC (3,244 KB)

Computer Science > Computation and Language

Title:Exploring Demonstration Ensembling for In-context Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Exploring Demonstration Ensembling for In-context Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators