SAM-PARSER: Fine-tuning SAM Efficiently by Parameter Space Reconstruction

Peng, Zelin; Xu, Zhengqin; Zeng, Zhilin; Yang, Xiaokang; Shen, Wei

Computer Science > Computer Vision and Pattern Recognition

arXiv:2308.14604 (cs)

[Submitted on 28 Aug 2023 (v1), last revised 18 Dec 2023 (this version, v3)]

Title:SAM-PARSER: Fine-tuning SAM Efficiently by Parameter Space Reconstruction

Authors:Zelin Peng, Zhengqin Xu, Zhilin Zeng, Xiaokang Yang, Wei Shen

View PDF HTML (experimental)

Abstract:Segment Anything Model (SAM) has received remarkable attention as it offers a powerful and versatile solution for object segmentation in images. However, fine-tuning SAM for downstream segmentation tasks under different scenarios remains a challenge, as the varied characteristics of different scenarios naturally requires diverse model parameter spaces. Most existing fine-tuning methods attempt to bridge the gaps among different scenarios by introducing a set of new parameters to modify SAM's original parameter space. Unlike these works, in this paper, we propose fine-tuning SAM efficiently by parameter space reconstruction (SAM-PARSER), which introduce nearly zero trainable parameters during fine-tuning. In SAM-PARSER, we assume that SAM's original parameter space is relatively complete, so that its bases are able to reconstruct the parameter space of a new scenario. We obtain the bases by matrix decomposition, and fine-tuning the coefficients to reconstruct the parameter space tailored to the new scenario by an optimal linear combination of the bases. Experimental results show that SAM-PARSER exhibits superior segmentation performance across various scenarios, while reducing the number of trainable parameters by $\approx 290$ times compared with current parameter-efficient fine-tuning methods.

Comments:	Accepted by AAAI2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2308.14604 [cs.CV]
	(or arXiv:2308.14604v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2308.14604

Submission history

From: Zelin Peng [view email]
[v1] Mon, 28 Aug 2023 14:17:16 UTC (1,171 KB)
[v2] Thu, 31 Aug 2023 03:07:03 UTC (1,171 KB)
[v3] Mon, 18 Dec 2023 07:40:35 UTC (1,227 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SAM-PARSER: Fine-tuning SAM Efficiently by Parameter Space Reconstruction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SAM-PARSER: Fine-tuning SAM Efficiently by Parameter Space Reconstruction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators