Finding and Editing Multi-Modal Neurons in Pre-Trained Transformer

Pan, Haowen; Cao, Yixin; Wang, Xiaozhi; Yang, Xun

Computer Science > Computation and Language

arXiv:2311.07470 (cs)

[Submitted on 13 Nov 2023]

Title:Finding and Editing Multi-Modal Neurons in Pre-Trained Transformer

Authors:Haowen Pan, Yixin Cao, Xiaozhi Wang, Xun Yang

View PDF

Abstract:Multi-modal large language models (LLM) have achieved powerful capabilities for visual semantic understanding in recent years. However, little is known about how LLMs comprehend visual information and interpret different modalities of features. In this paper, we propose a new method for identifying multi-modal neurons in transformer-based multi-modal LLMs. Through a series of experiments, We highlight three critical properties of multi-modal neurons by four well-designed quantitative evaluation metrics. Furthermore, we introduce a knowledge editing method based on the identified multi-modal neurons, for modifying a specific token to another designative token. We hope our findings can inspire further explanatory researches on understanding mechanisms of multi-modal LLMs.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2311.07470 [cs.CL]
	(or arXiv:2311.07470v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2311.07470

Submission history

From: Haowen Pan [view email]
[v1] Mon, 13 Nov 2023 17:03:02 UTC (7,365 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2311

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:Finding and Editing Multi-Modal Neurons in Pre-Trained Transformer

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Finding and Editing Multi-Modal Neurons in Pre-Trained Transformer

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators