Toward High Quality Facial Representation Learning

Wang, Yue; Peng, Jinlong; Zhang, Jiangning; Yi, Ran; Liu, Liang; Wang, Yabiao; Wang, Chengjie

doi:10.1145/3581783.3611999

Computer Science > Computer Vision and Pattern Recognition

arXiv:2309.03575 (cs)

[Submitted on 7 Sep 2023]

Title:Toward High Quality Facial Representation Learning

Authors:Yue Wang, Jinlong Peng, Jiangning Zhang, Ran Yi, Liang Liu, Yabiao Wang, Chengjie Wang

View PDF

Abstract:Face analysis tasks have a wide range of applications, but the universal facial representation has only been explored in a few works. In this paper, we explore high-performance pre-training methods to boost the face analysis tasks such as face alignment and face parsing. We propose a self-supervised pre-training framework, called \textbf{\it Mask Contrastive Face (MCF)}, with mask image modeling and a contrastive strategy specially adjusted for face domain tasks. To improve the facial representation quality, we use feature map of a pre-trained visual backbone as a supervision item and use a partially pre-trained decoder for mask image modeling. To handle the face identity during the pre-training stage, we further use random masks to build contrastive learning pairs. We conduct the pre-training on the LAION-FACE-cropped dataset, a variants of LAION-FACE 20M, which contains more than 20 million face images from Internet websites. For efficiency pre-training, we explore our framework pre-training performance on a small part of LAION-FACE-cropped and verify the superiority with different pre-training settings. Our model pre-trained with the full pre-training dataset outperforms the state-of-the-art methods on multiple downstream tasks. Our model achieves 0.932 NME$_{diag}$ for AFLW-19 face alignment and 93.96 F1 score for LaPa face parsing. Code is available at this https URL.

Comments:	ACM MM 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2309.03575 [cs.CV]
	(or arXiv:2309.03575v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2309.03575
Related DOI:	https://doi.org/10.1145/3581783.3611999

Submission history

From: Yue Wang [view email]
[v1] Thu, 7 Sep 2023 09:11:49 UTC (577 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Toward High Quality Facial Representation Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Toward High Quality Facial Representation Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators