Preventing Catastrophic Forgetting in Continual Learning of New Natural Language Tasks

Kar, Sudipta; Castellucci, Giuseppe; Filice, Simone; Malmasi, Shervin; Rokhlenko, Oleg

doi:10.1145/3534678.3539169

Computer Science > Computation and Language

arXiv:2302.11074 (cs)

[Submitted on 22 Feb 2023]

Title:Preventing Catastrophic Forgetting in Continual Learning of New Natural Language Tasks

Authors:Sudipta Kar, Giuseppe Castellucci, Simone Filice, Shervin Malmasi, Oleg Rokhlenko

View PDF

Abstract:Multi-Task Learning (MTL) is widely-accepted in Natural Language Processing as a standard technique for learning multiple related tasks in one model. Training an MTL model requires having the training data for all tasks available at the same time. As systems usually evolve over time, (e.g., to support new functionalities), adding a new task to an existing MTL model usually requires retraining the model from scratch on all the tasks and this can be time-consuming and computationally expensive. Moreover, in some scenarios, the data used to train the original training may be no longer available, for example, due to storage or privacy concerns. In this paper, we approach the problem of incrementally expanding MTL models' capability to solve new tasks over time by distilling the knowledge of an already trained model on n tasks into a new one for solving n+1 tasks. To avoid catastrophic forgetting, we propose to exploit unlabeled data from the same distributions of the old tasks. Our experiments on publicly available benchmarks show that such a technique dramatically benefits the distillation by preserving the already acquired knowledge (i.e., preventing up to 20% performance drops on old tasks) while obtaining good performance on the incrementally added tasks. Further, we also show that our approach is beneficial in practical settings by using data from a leading voice assistant.

Comments:	KDD 2022
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2302.11074 [cs.CL]
	(or arXiv:2302.11074v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2302.11074
Related DOI:	https://doi.org/10.1145/3534678.3539169

Submission history

From: Sudipta Kar [view email]
[v1] Wed, 22 Feb 2023 00:18:25 UTC (204 KB)

Computer Science > Computation and Language

Title:Preventing Catastrophic Forgetting in Continual Learning of New Natural Language Tasks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Preventing Catastrophic Forgetting in Continual Learning of New Natural Language Tasks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators