SalKG: Learning From Knowledge Graph Explanations for Commonsense Reasoning

Chan, Aaron; Xu, Jiashu; Long, Boyuan; Sanyal, Soumya; Gupta, Tanishq; Ren, Xiang

Computer Science > Computation and Language

arXiv:2104.08793 (cs)

[Submitted on 18 Apr 2021 (v1), last revised 20 Mar 2022 (this version, v5)]

Title:SalKG: Learning From Knowledge Graph Explanations for Commonsense Reasoning

Authors:Aaron Chan, Jiashu Xu, Boyuan Long, Soumya Sanyal, Tanishq Gupta, Xiang Ren

View PDF

Abstract:Augmenting pre-trained language models with knowledge graphs (KGs) has achieved success on various commonsense reasoning tasks. However, for a given task instance, the KG, or certain parts of the KG, may not be useful. Although KG-augmented models often use attention to focus on specific KG components, the KG is still always used, and the attention mechanism is never explicitly taught which KG components should be used. Meanwhile, saliency methods can measure how much a KG feature (e.g., graph, node, path) influences the model to make the correct prediction, thus explaining which KG features are useful. This paper explores how saliency explanations can be used to improve KG-augmented models' performance. First, we propose to create coarse (Is the KG useful?) and fine (Which nodes/paths in the KG are useful?) saliency explanations. Second, to motivate saliency-based supervision, we analyze oracle KG-augmented models which directly use saliency explanations as extra inputs for guiding their attention. Third, we propose SalKG, a framework for KG-augmented models to learn from coarse and/or fine saliency explanations. Given saliency explanations created from a task's training set, SalKG jointly trains the model to predict the explanations, then solve the task by attending to KG features highlighted by the predicted explanations. On three commonsense QA benchmarks (CSQA, OBQA, CODAH) and a range of KG-augmented models, we show that SalKG can yield considerable performance gains -- up to 2.76% absolute improvement on CSQA.

Comments:	NeurIPS 2021
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2104.08793 [cs.CL]
	(or arXiv:2104.08793v5 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2104.08793

Submission history

From: Aaron Chan [view email]
[v1] Sun, 18 Apr 2021 09:59:46 UTC (549 KB)
[v2] Mon, 12 Jul 2021 18:53:44 UTC (765 KB)
[v3] Tue, 7 Dec 2021 20:00:29 UTC (1,214 KB)
[v4] Sat, 15 Jan 2022 06:04:57 UTC (1,214 KB)
[v5] Sun, 20 Mar 2022 04:02:52 UTC (1,214 KB)

Computer Science > Computation and Language

Title:SalKG: Learning From Knowledge Graph Explanations for Commonsense Reasoning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:SalKG: Learning From Knowledge Graph Explanations for Commonsense Reasoning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators