Training Naturalized Semantic Parsers with Very Little Data

Rongali, Subendhu; Arkoudas, Konstantine; Rubino, Melanie; Hamza, Wael

Computer Science > Computation and Language

arXiv:2204.14243 (cs)

[Submitted on 29 Apr 2022 (v1), last revised 4 May 2022 (this version, v2)]

Title:Training Naturalized Semantic Parsers with Very Little Data

Authors:Subendhu Rongali, Konstantine Arkoudas, Melanie Rubino, Wael Hamza

View PDF

Abstract:Semantic parsing is an important NLP problem, particularly for voice assistants such as Alexa and Google Assistant. State-of-the-art (SOTA) semantic parsers are seq2seq architectures based on large language models that have been pretrained on vast amounts of text. To better leverage that pretraining, recent work has explored a reformulation of semantic parsing whereby the output sequences are themselves natural language sentences, but in a controlled fragment of natural language. This approach delivers strong results, particularly for few-shot semantic parsing, which is of key importance in practice and the focus of our paper. We push this line of work forward by introducing an automated methodology that delivers very significant additional improvements by utilizing modest amounts of unannotated data, which is typically easy to obtain. Our method is based on a novel synthesis of four techniques: joint training with auxiliary unsupervised tasks; constrained decoding; self-training; and paraphrasing. We show that this method delivers new SOTA few-shot performance on the Overnight dataset, particularly in very low-resource settings, and very compelling few-shot results on a new semantic parsing dataset.

Comments:	IJCAI 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2204.14243 [cs.CL]
	(or arXiv:2204.14243v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2204.14243

Submission history

From: Subendhu Rongali [view email]
[v1] Fri, 29 Apr 2022 17:14:54 UTC (99 KB)
[v2] Wed, 4 May 2022 17:52:49 UTC (99 KB)

Computer Science > Computation and Language

Title:Training Naturalized Semantic Parsers with Very Little Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Training Naturalized Semantic Parsers with Very Little Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators