SwiftLearn: A Data-Efficient Training Method of Deep Learning Models using Importance Sampling

Hajimolahoseini, Habib; Awad, Omar Mohamed; Ahmed, Walid; Wen, Austin; Asani, Saina; Hassanpour, Mohammad; Javadi, Farnoosh; Ahmadi, Mehdi; Ataiefard, Foozhan; Liu, Kangling; Liu, Yang

Computer Science > Machine Learning

arXiv:2311.15134 (cs)

[Submitted on 25 Nov 2023]

Title:SwiftLearn: A Data-Efficient Training Method of Deep Learning Models using Importance Sampling

Authors:Habib Hajimolahoseini, Omar Mohamed Awad, Walid Ahmed, Austin Wen, Saina Asani, Mohammad Hassanpour, Farnoosh Javadi, Mehdi Ahmadi, Foozhan Ataiefard, Kangling Liu, Yang Liu

View PDF

Abstract:In this paper, we present SwiftLearn, a data-efficient approach to accelerate training of deep learning models using a subset of data samples selected during the warm-up stages of training. This subset is selected based on an importance criteria measured over the entire dataset during warm-up stages, aiming to preserve the model performance with fewer examples during the rest of training. The importance measure we propose could be updated during training every once in a while, to make sure that all of the data samples have a chance to return to the training loop if they show a higher importance. The model architecture is unchanged but since the number of data samples controls the number of forward and backward passes during training, we can reduce the training time by reducing the number of training samples used in each epoch of training. Experimental results on a variety of CV and NLP models during both pretraining and finetuning show that the model performance could be preserved while achieving a significant speed-up during training. More specifically, BERT finetuning on GLUE benchmark shows that almost 90% of the data can be dropped achieving an end-to-end average speedup of 3.36x while keeping the average accuracy drop less than 0.92%.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2311.15134 [cs.LG]
	(or arXiv:2311.15134v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.15134

Submission history

From: Omar Mohamed Awad [view email]
[v1] Sat, 25 Nov 2023 22:51:01 UTC (1,516 KB)

Computer Science > Machine Learning

Title:SwiftLearn: A Data-Efficient Training Method of Deep Learning Models using Importance Sampling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:SwiftLearn: A Data-Efficient Training Method of Deep Learning Models using Importance Sampling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators