SAMCLR: Contrastive pre-training on complex scenes using SAM for view sampling

Missaoui, Benjamin; Yuan, Chongbin

Computer Science > Computer Vision and Pattern Recognition

arXiv:2310.14736 (cs)

[Submitted on 23 Oct 2023 (v1), last revised 29 Oct 2023 (this version, v2)]

Title:SAMCLR: Contrastive pre-training on complex scenes using SAM for view sampling

Authors:Benjamin Missaoui, Chongbin Yuan

View PDF

Abstract:In Computer Vision, self-supervised contrastive learning enforces similar representations between different views of the same image. The pre-training is most often performed on image classification datasets, like ImageNet, where images mainly contain a single class of objects. However, when dealing with complex scenes with multiple items, it becomes very unlikely for several views of the same image to represent the same object category. In this setting, we propose SAMCLR, an add-on to SimCLR which uses SAM to segment the image into semantic regions, then sample the two views from the same region. Preliminary results show empirically that when pre-training on Cityscapes and ADE20K, then evaluating on classification on CIFAR-10, STL10 and ImageNette, SAMCLR performs at least on par with, and most often significantly outperforms not only SimCLR, but also DINO and MoCo.

Comments:	Accepted at NeurIPS 2023 Workshop on SSL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2310.14736 [cs.CV]
	(or arXiv:2310.14736v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2310.14736

Submission history

From: Benjamin Missaoui [view email]
[v1] Mon, 23 Oct 2023 09:16:04 UTC (601 KB)
[v2] Sun, 29 Oct 2023 03:57:18 UTC (214 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SAMCLR: Contrastive pre-training on complex scenes using SAM for view sampling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SAMCLR: Contrastive pre-training on complex scenes using SAM for view sampling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators