TILFA: A Unified Framework for Text, Image, and Layout Fusion in Argument Mining

Zong, Qing; Wang, Zhaowei; Xu, Baixuan; Zheng, Tianshi; Shi, Haochen; Wang, Weiqi; Song, Yangqiu; Wong, Ginny Y.; See, Simon

Computer Science > Artificial Intelligence

arXiv:2310.05210 (cs)

[Submitted on 8 Oct 2023]

Title:TILFA: A Unified Framework for Text, Image, and Layout Fusion in Argument Mining

Authors:Qing Zong, Zhaowei Wang, Baixuan Xu, Tianshi Zheng, Haochen Shi, Weiqi Wang, Yangqiu Song, Ginny Y. Wong, Simon See

View PDF

Abstract:A main goal of Argument Mining (AM) is to analyze an author's stance. Unlike previous AM datasets focusing only on text, the shared task at the 10th Workshop on Argument Mining introduces a dataset including both text and images. Importantly, these images contain both visual elements and optical characters. Our new framework, TILFA (A Unified Framework for Text, Image, and Layout Fusion in Argument Mining), is designed to handle this mixed data. It excels at not only understanding text but also detecting optical characters and recognizing layout details in images. Our model significantly outperforms existing baselines, earning our team, KnowComp, the 1st place in the leaderboard of Argumentative Stance Classification subtask in this shared task.

Comments:	Accepted to the 10th Workshop on Argument Mining, co-located with EMNLP 2023
Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2310.05210 [cs.AI]
	(or arXiv:2310.05210v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2310.05210

Submission history

From: Qing Zong [view email]
[v1] Sun, 8 Oct 2023 15:54:37 UTC (1,469 KB)

Computer Science > Artificial Intelligence

Title:TILFA: A Unified Framework for Text, Image, and Layout Fusion in Argument Mining

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:TILFA: A Unified Framework for Text, Image, and Layout Fusion in Argument Mining

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators