K-VQG: Knowledge-aware Visual Question Generation for Common-sense Acquisition

Uehara, Kohei; Harada, Tatsuya

Computer Science > Computer Vision and Pattern Recognition

arXiv:2203.07890 (cs)

[Submitted on 15 Mar 2022]

Title:K-VQG: Knowledge-aware Visual Question Generation for Common-sense Acquisition

Authors:Kohei Uehara, Tatsuya Harada

View PDF

Abstract:Visual Question Generation (VQG) is a task to generate questions from images. When humans ask questions about an image, their goal is often to acquire some new knowledge. However, existing studies on VQG have mainly addressed question generation from answers or question categories, overlooking the objectives of knowledge acquisition. To introduce a knowledge acquisition perspective into VQG, we constructed a novel knowledge-aware VQG dataset called K-VQG. This is the first large, humanly annotated dataset in which questions regarding images are tied to structured knowledge. We also developed a new VQG model that can encode and use knowledge as the target for a question. The experiment results show that our model outperforms existing models on the K-VQG dataset.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
Cite as:	arXiv:2203.07890 [cs.CV]
	(or arXiv:2203.07890v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2203.07890

Submission history

From: Kohei Uehara [view email]
[v1] Tue, 15 Mar 2022 13:38:10 UTC (10,410 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2203

Change to browse by:

cs
cs.CL

References & Citations

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:K-VQG: Knowledge-aware Visual Question Generation for Common-sense Acquisition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:K-VQG: Knowledge-aware Visual Question Generation for Common-sense Acquisition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators