Modular and Parameter-Efficient Multimodal Fusion with Prompting

Liang, Sheng; Zhao, Mengjie; Schütze, Hinrich

Computer Science > Computation and Language

arXiv:2203.08055 (cs)

[Submitted on 15 Mar 2022]

Title:Modular and Parameter-Efficient Multimodal Fusion with Prompting

Authors:Sheng Liang, Mengjie Zhao, Hinrich Schütze

View PDF

Abstract:Recent research has made impressive progress in large-scale multimodal pre-training. In the context of the rapid growth of model size, it is necessary to seek efficient and flexible methods other than finetuning. In this paper, we propose to use prompt vectors to align the modalities. Our method achieves comparable performance to several other multimodal fusion methods in low-resource settings. We further show that our method is modular and parameter-efficient for processing tasks involving two or more data modalities.

Comments:	Accepted to Findings of ACL 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2203.08055 [cs.CL]
	(or arXiv:2203.08055v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2203.08055

Submission history

From: Sheng Liang [view email]
[v1] Tue, 15 Mar 2022 16:50:15 UTC (1,601 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2203

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:Modular and Parameter-Efficient Multimodal Fusion with Prompting

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Modular and Parameter-Efficient Multimodal Fusion with Prompting

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators