Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing

Lin, Xun; Wang, Shuai; Cai, Rizhao; Liu, Yizhong; Fu, Ying; Yu, Zitong; Tang, Wenzhong; Kot, Alex

Computer Science > Computer Vision and Pattern Recognition

arXiv:2402.19298 (cs)

[Submitted on 29 Feb 2024 (v1), last revised 5 Mar 2024 (this version, v2)]

Title:Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing

Authors:Xun Lin, Shuai Wang, Rizhao Cai, Yizhong Liu, Ying Fu, Zitong Yu, Wenzhong Tang, Alex Kot

View PDF HTML (experimental)

Abstract:Face Anti-Spoofing (FAS) is crucial for securing face recognition systems against presentation attacks. With advancements in sensor manufacture and multi-modal learning techniques, many multi-modal FAS approaches have emerged. However, they face challenges in generalizing to unseen attacks and deployment conditions. These challenges arise from (1) modality unreliability, where some modality sensors like depth and infrared undergo significant domain shifts in varying environments, leading to the spread of unreliable information during cross-modal feature fusion, and (2) modality imbalance, where training overly relies on a dominant modality hinders the convergence of others, reducing effectiveness against attack types that are indistinguishable sorely using the dominant modality. To address modality unreliability, we propose the Uncertainty-Guided Cross-Adapter (U-Adapter) to recognize unreliably detected regions within each modality and suppress the impact of unreliable regions on other modalities. For modality imbalance, we propose a Rebalanced Modality Gradient Modulation (ReGrad) strategy to rebalance the convergence speed of all modalities by adaptively adjusting their gradients. Besides, we provide the first large-scale benchmark for evaluating multi-modal FAS performance under domain generalization scenarios. Extensive experiments demonstrate that our method outperforms state-of-the-art methods. Source code and protocols will be released on this https URL.

Comments:	Accepeted by CVPR 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2402.19298 [cs.CV]
	(or arXiv:2402.19298v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2402.19298

Submission history

From: Xun Lin [view email]
[v1] Thu, 29 Feb 2024 16:06:36 UTC (4,416 KB)
[v2] Tue, 5 Mar 2024 11:59:29 UTC (4,417 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators