Less is More: Selective Layer Finetuning with SubTuning

Kaplun, Gal; Gurevich, Andrey; Swisa, Tal; David, Mazor; Shalev-Shwartz, Shai; Malach, Eran

Computer Science > Machine Learning

arXiv:2302.06354 (cs)

[Submitted on 13 Feb 2023 (v1), last revised 2 Jul 2023 (this version, v3)]

Title:Less is More: Selective Layer Finetuning with SubTuning

Authors:Gal Kaplun, Andrey Gurevich, Tal Swisa, Mazor David, Shai Shalev-Shwartz, Eran Malach

View PDF

Abstract:Finetuning a pretrained model has become a standard approach for training neural networks on novel tasks, resulting in fast convergence and improved performance. In this work, we study an alternative finetuning method, where instead of finetuning all the weights of the network, we only train a carefully chosen subset of layers, keeping the rest of the weights frozen at their initial (pretrained) values. We demonstrate that \emph{subset finetuning} (or SubTuning) often achieves accuracy comparable to full finetuning of the model, and even surpasses the performance of full finetuning when training data is scarce. Therefore, SubTuning allows deploying new tasks at minimal computational cost, while enjoying the benefits of finetuning the entire model. This yields a simple and effective method for multi-task learning, where different tasks do not interfere with one another, and yet share most of the resources at inference time. We demonstrate the efficiency of SubTuning across multiple tasks, using different network architectures and pretraining methods.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2302.06354 [cs.LG]
	(or arXiv:2302.06354v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2302.06354

Submission history

From: Gal Kaplun [view email]
[v1] Mon, 13 Feb 2023 13:38:46 UTC (3,272 KB)
[v2] Tue, 14 Feb 2023 02:03:11 UTC (3,272 KB)
[v3] Sun, 2 Jul 2023 12:28:46 UTC (4,163 KB)

Computer Science > Machine Learning

Title:Less is More: Selective Layer Finetuning with SubTuning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Less is More: Selective Layer Finetuning with SubTuning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators