Comprehensive Survey of Model Compression and Speed up for Vision Transformers

Chen, Feiyang; Luo, Ziqian; Zhou, Lisang; Pan, Xueting; Jiang, Ying

Computer Science > Computer Vision and Pattern Recognition

arXiv:2404.10407 (cs)

[Submitted on 16 Apr 2024]

Title:Comprehensive Survey of Model Compression and Speed up for Vision Transformers

Authors:Feiyang Chen, Ziqian Luo, Lisang Zhou, Xueting Pan, Ying Jiang

View PDF

Abstract:Vision Transformers (ViT) have marked a paradigm shift in computer vision, outperforming state-of-the-art models across diverse tasks. However, their practical deployment is hampered by high computational and memory demands. This study addresses the challenge by evaluating four primary model compression techniques: quantization, low-rank approximation, knowledge distillation, and pruning. We methodically analyze and compare the efficacy of these techniques and their combinations in optimizing ViTs for resource-constrained environments. Our comprehensive experimental evaluation demonstrates that these methods facilitate a balanced compromise between model accuracy and computational efficiency, paving the way for wider application in edge computing devices.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2404.10407 [cs.CV]
	(or arXiv:2404.10407v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2404.10407
Journal reference:	Journal of Information, Technology and Policy (2024): 1-12

Submission history

From: Lisang Zhou [view email]
[v1] Tue, 16 Apr 2024 09:19:11 UTC (679 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Comprehensive Survey of Model Compression and Speed up for Vision Transformers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Comprehensive Survey of Model Compression and Speed up for Vision Transformers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators