Intra-variance Guided Metric Learning for Face Forgery Detection

Chen, Zhentao; Hu, Junlin

doi:10.1007/978-981-99-8565-4_14

Zhentao Chen¹⁵ &
Junlin Hu¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14463))

Included in the following conference series:

Chinese Conference on Biometric Recognition

Abstract

Since facial manipulation technology has raised serious concerns, facial forgery detection has also attracted increasing attention. Although recent work has made good achievements, the detection of unseen fake faces is still a big challenge. In this paper, we tackle facial forgery detection problem from the perspective of distance metric learning, and design a new Intra-Variance guided Metric Learning (IVML) method to drive classification and adopt Vision Transformer (ViT) as the backbone, which aims to improve the generalization ability of face forgery detection methods. Specifically, considering that there is a large gap between different real faces, our proposed IVML method increases the distance between real and fake faces while maintaining a certain distance within real faces. We choose ViT as the backbone as our experiments prove that ViT has better generalization ability in face forgery detection. A large number of experiments demonstrate the effectiveness and superiority of our IVML method in cross-dataset evaluation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Goodfellow, I.J., et al.: Generative adversarial networks. Commun. ACM 63, 139–144 (2020)
Article Google Scholar
Kingma, D.P., Welling, M.: Auto-encoding variational Bayes. In: International Conference on Learning Representations (2014)
Google Scholar
Afchar, D., Nozick, V., Yamagishi, J., Echizen, I.: MesoNet: a compact facial video forgery detection network. In: IEEE International Workshop on Information Forensics and Security, pp. 1–7 (2018)
Google Scholar
Rössler, A., Cozzolino, D., Verdoliva, L., Riess, C., Thies, J., Nießner, M.: Faceforensics++: learning to detect manipulated facial images. In: International Conference on Computer Vision, pp. 1–11 (2019)
Google Scholar
Dang, H., Liu, F., Stehouwer, J., Liu, X., Jain, A.K.: On the detection of digital face manipulation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 5780–5789 (2020)
Google Scholar
Li, L., et al.: Face x-ray for more general face forgery detection. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 5000–5009 (2020)
Google Scholar
Wang, C., Deng, W.: Representative forgery mining for fake face detection. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 14923–14932 (2021)
Google Scholar
Gu, Q., Chen, S., Yao, T., Chen, Y., Ding, S., Yi, R.: Exploiting fine-grained face forgery clues via progressive enhancement learning. In: AAAI Conference on Artificial Intelligence, pp. 735–743 (2022)
Google Scholar
Zhou, P., Han, X., Morariu, V.I., Davis, L.S.: Two-stream neural networks for tampered face detection. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 1831–1839 (2017)
Google Scholar
Chen, S., Yao, T., Chen, Y., Ding, S., Li, J., Ji, R.: Local relation learning for face forgery detection. In: AAAI Conference on Artificial Intelligence, pp. 1081–1088 (2021)
Google Scholar
Zhao, H., Zhou, W., Chen, D., Wei, T., Zhang, W., Yu, N.: Multi-attentional deepfake detection. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2185–2194 (2021)
Google Scholar
Li, J., Xie, H., Li, J., Wang, Z., Zhang, Y.: Frequency-aware discriminative feature learning supervised by single-center loss for face forgery detection. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 6458–6467 (2021)
Google Scholar
Nguyen, H.H., Yamagishi, J., Echizen, I.: Capsule-forensics: Using capsule networks to detect forged images and videos. In: International Conference on Acoustics, Speech and Signal Processing, pp. 2307–2311 (2019)
Google Scholar
Chollet, F.: Xception: deep learning with depthwise separable convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1800–1807 (2017)
Google Scholar
Cao, J., Ma, C., Yao, T., Chen, S., Ding, S., Yang, X.: End-to-end reconstruction-classification learning for face forgery detection. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4103–4112 (2022)
Google Scholar
Huang, B., et al.: Implicit identity driven deepfake face swapping detection. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4490–4499 (2023)
Google Scholar
Wen, Y., Zhang, K., Li, Z., Qiao, Y.: A discriminative feature learning approach for deep face recognition. In: European Conference on Computer Vision, pp. 499–515 (2016)
Google Scholar
Schroff, F., Kalenichenko, D., Philbin, J.: FaceNet: a unified embedding for face recognition and clustering. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 815–823 (2015)
Google Scholar
Kumar, A., Bhavsar, A., Verma, R.: Detecting deepfakes with metric learning. In: International Workshop on Biometrics and Forensics, pp. 1–6 (2020)
Google Scholar
Dosovitskiy, A., et al.: An image is worth 16x16 words: transformers for image recognition at scale. In: International Conference on Learning Representations (2021)
Google Scholar
Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017)
Google Scholar
Steiner, A., Kolesnikov, A., Zhai, X., Wightman, R., Uszkoreit, J., Beyer, L.: How to train your vit? data, augmentation, and regularization in vision transformers. Trans. Mach. Learn. Res. 2022 (2022)
Google Scholar
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: a large-scale hierarchical image database. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255 (2009)
Google Scholar
Li, Y., Yang, X., Sun, P., Qi, H., Lyu, S.: Celeb-DF: a large-scale challenging dataset for deepfake forensics. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3204–3213 (2020)
Google Scholar
Thies, J., Zollhöfer, M., Stamminger, M., Theobalt, C., Nießner, M.: Face2face: real-time face capture and reenactment of RGB videos. Commun. ACM 62(1), 96–104 (2019)
Article Google Scholar
Thies, J., Zollhöfer, M., Nießner, M.: Deferred neural rendering: image synthesis using neural textures. ACM Trans. Graph. 38(4) 66:1–66:12 (2019)
Google Scholar
Paszke, A., et al.: Automatic differentiation in pytorch. In: Advances in Neural Information Processing Systems Workshop (2017)
Google Scholar
Tan, M., Le, Q.V.: Efficientnet: rethinking model scaling for convolutional neural networks. In: International Conference on Machine Learning, pp. 6105–6114 (2019)
Google Scholar
Li, D., Yang, Y., Song, Y., Hospedales, T.M.: Learning to generalize: meta-learning for domain generalization. In: AAAI Conference on Artificial Intelligence, pp. 3490–3497 (2018)
Google Scholar
Luo, Y., Zhang, Y., Yan, J., Liu, W.: Generalizing face forgery detection with high-frequency features. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 16317–16326 (2021)
Google Scholar

Download references

Acknowledgments

This work was supported by the National Natural Science Foundation of China under Grant 62006013.

Author information

Authors and Affiliations

School of Software, Beihang University, Beijing, China
Zhentao Chen & Junlin Hu

Authors

Zhentao Chen
View author publications
You can also search for this author in PubMed Google Scholar
Junlin Hu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Junlin Hu .

Editor information

Editors and Affiliations

Hefei University of Technology, Hefei, China
Wei Jia
South China University of Technology, Guangzhou, China
Wenxiong Kang
China University of Mining and Technology, Xuzhou, China
Zaiyu Pan
Shandong University, Jinan, China
Xianye Ben
China University of Mining and Technology, Xuzhou, China
Zhengfu Bian
Southern University of Science and Technology, Shenzhen, China
Shiqi Yu
Chinese Academy of Sciences, Beijing, China
Zhaofeng He
China University of Mining and Technology, Xuzhou, China
Jun Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, Z., Hu, J. (2023). Intra-variance Guided Metric Learning for Face Forgery Detection. In: Jia, W., et al. Biometric Recognition. CCBR 2023. Lecture Notes in Computer Science, vol 14463. Springer, Singapore. https://doi.org/10.1007/978-981-99-8565-4_14

Download citation

DOI: https://doi.org/10.1007/978-981-99-8565-4_14
Published: 02 December 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8564-7
Online ISBN: 978-981-99-8565-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Intra-variance Guided Metric Learning for Face Forgery Detection