Catastrophic Forgetting in the Context of Model Updates

Harang, Rich; Sanders, Hillary

Computer Science > Machine Learning

arXiv:2306.10181 (cs)

[Submitted on 16 Jun 2023]

Title:Catastrophic Forgetting in the Context of Model Updates

Authors:Rich Harang, Hillary Sanders

View PDF

Abstract:A large obstacle to deploying deep learning models in practice is the process of updating models post-deployment (ideally, frequently). Deep neural networks can cost many thousands of dollars to train. When new data comes in the pipeline, you can train a new model from scratch (randomly initialized weights) on all existing data. Instead, you can take an existing model and fine-tune (continue to train) it on new data. The former is costly and slow. The latter is cheap and fast, but catastrophic forgetting generally causes the new model to 'forget' how to classify older data well. There are a plethora of complicated techniques to keep models from forgetting their past learnings. Arguably the most basic is to mix in a small amount of past data into the new data during fine-tuning: also known as 'data rehearsal'. In this paper, we compare various methods of limiting catastrophic forgetting and conclude that if you can maintain access to a portion of your past data (or tasks), data rehearsal is ideal in terms of overall accuracy across all time periods, and performs even better when combined with methods like Elastic Weight Consolidation (EWC). Especially when the amount of past data (past 'tasks') is large compared to new data, the cost of updating an existing model is far cheaper and faster than training a new model from scratch.

Comments:	We wrote this in 2021, though didn't get around to putting it up. State of the art has improved a bit since then, but the experiments I think are still interesting and relevant
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2306.10181 [cs.LG]
	(or arXiv:2306.10181v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.10181

Submission history

From: Hillary Sanders [view email]
[v1] Fri, 16 Jun 2023 21:21:41 UTC (2,207 KB)

Computer Science > Machine Learning

Title:Catastrophic Forgetting in the Context of Model Updates

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Catastrophic Forgetting in the Context of Model Updates

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators