Improving Interpretability via Explicit Word Interaction Graph Layer

Arshdeep Sekhon; Hanjie Chen; Aman Shrivastava; Zhe Wang; Yangfeng Ji; Yanjun Qi

doi:10.1609/aaai.v37i11.26586

Improving Interpretability via Explicit Word Interaction Graph Layer

Authors

Arshdeep Sekhon University of Virginia
Hanjie Chen University of Virginia
Aman Shrivastava University of Virginia
Zhe Wang University of Virginia
Yangfeng Ji University of Virginia
Yanjun Qi University of Virginia

DOI:

https://doi.org/10.1609/aaai.v37i11.26586

Keywords:

SNLP: Interpretability & Analysis of NLP Models, ML: Graph-based Machine Learning

Abstract

Recent NLP literature has seen growing interest in improving model interpretability. Along this direction, we propose a trainable neural network layer that learns a global interaction graph between words and then selects more informative words using the learned word interactions. Our layer, we call WIGRAPH, can plug into any neural network-based NLP text classifiers right after its word embedding layer. Across multiple SOTA NLP models and various NLP datasets, we demonstrate that adding the WIGRAPH layer substantially improves NLP models' interpretability and enhances models' prediction performance at the same time.

Downloads

Published

2023-06-26

How to Cite

Sekhon, A., Chen, H., Shrivastava, A., Wang, Z., Ji, Y., & Qi, Y. (2023). Improving Interpretability via Explicit Word Interaction Graph Layer. Proceedings of the AAAI Conference on Artificial Intelligence, 37(11), 13528-13537. https://doi.org/10.1609/aaai.v37i11.26586

Download Citation

Issue

Vol. 37 No. 11: AAAI-23 Technical Tracks 11

Section

AAAI Technical Track on Speech & Natural Language Processing

Improving Interpretability via Explicit Word Interaction Graph Layer

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription