Which Pretrain Samples to Rehearse when Finetuning Pretrained Models?

Bai, Andrew; Yeh, Chih-Kuan; Hsieh, Cho-Jui; Taly, Ankur

Computer Science > Machine Learning

arXiv:2402.08096 (cs)

[Submitted on 12 Feb 2024]

Title:Which Pretrain Samples to Rehearse when Finetuning Pretrained Models?

Authors:Andrew Bai, Chih-Kuan Yeh, Cho-Jui Hsieh, Ankur Taly

View PDF HTML (experimental)

Abstract:Fine-tuning pretrained foundational models on specific tasks is now the de facto approach for text and vision tasks. A known pitfall of this approach is the forgetting of pretraining knowledge that happens during finetuning. Rehearsing samples randomly from the pretrain dataset is a common approach to alleviate such forgetting. However, we find that random mixing unintentionally includes samples which are not (yet) forgotten or unlearnable by the model. We propose a novel sampling scheme, mix-cd, that identifies and prioritizes samples that actually face forgetting, which we call collateral damage. Since directly identifying collateral damage samples is computationally expensive, we propose a procedure to estimate the distribution of such samples by tracking the statistics of finetuned samples. Our approach is lightweight, easy to implement, and can be seamlessly integrated into existing models, offering an effective means to retain pretrain performance without additional computational costs.

Comments:	17 pages, 13 figures
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2402.08096 [cs.LG]
	(or arXiv:2402.08096v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2402.08096

Submission history

From: Andrew Bai [view email]
[v1] Mon, 12 Feb 2024 22:32:12 UTC (1,845 KB)

Computer Science > Machine Learning

Title:Which Pretrain Samples to Rehearse when Finetuning Pretrained Models?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Which Pretrain Samples to Rehearse when Finetuning Pretrained Models?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators