ABSTRACT
This paper investigates a new attack method to gait recognition systems. Different from typical spoofing attacks that require impostors to mimic certain clothing or walking styles, it proposes to intercept the video stream captured by the on-site camera and replace it with synthesized samples. To this end, we present a novel Generative Adversarial Network (GAN) based approach, which is able to render a faked video from the source walking sequence of a specified subject and the target scene image with both good visual effects and sufficient discriminative details. A new generator architecture is built, where the features of the source foreground sequence and the target background image are combined at multiple scales, making the synthesized video vivid. To fool recognition systems, the silhouette-conditioned losses are specially designed to constrain the static and dynamic consistency between the subjects in the source and generated videos. The person re-identification similarity based triplet loss is exploited to guide the generator, which keeps the personalized appearance properties stable. The edge and flow-related losses further regulate the generation of the attacking video. Two state-of-the-art gait recognition systems are used for evaluation, namely GaitSet and CNN-Gait, and we analyze their performance under attacking. Both the visual fidelity and attacking ability of the generated videos validate the effectiveness of the proposed method.
- H Abdenour, G Mohammad, et al. 2012. Can Gait Biometrics be Spoofed. In International Conference on Pattern Recognition. 3280--3283.Google Scholar
- Battista Biggio, Zahid Akhtar, Giorgio Fumera, Gian Luca Marcialis, and Fabio Roli. 2012. Security Evaluation of Biometric Authentication Systems under Real Spoofing Attacks. IET biometrics 1, 1 (2012), 11--24.Google Scholar
- Caroline Chan, Shiry Ginosar, Tinghui Zhou, and Alexei A Efros. 2018. Everybody Dance Now. arXiv preprint arXiv:1808.07371 (2018).Google Scholar
- Hanqing Chao, Yiwei He, Junping Zhang, and Jianfeng Feng. 2019. GaitSet: Regarding Gait as a Set for Cross-View Gait Recognition. In AAAI Conference on Artificial Intelligence. 8126--8133.Google ScholarCross Ref
- David Cunado, Mark S Nixon, and John N Carter. 1997. Using Gait as a Biometric, via Phase-weighted Magnitude Spectra. In International Conference on Audio-and Video-Based Biometric Person Authentication. 93--102.Google ScholarCross Ref
- Davrondzhon Gafurov, Einar Snekkenes, and Patrick Bours. 2007. Spoof Attacks on Gait Authentication System. IEEE Transactions on Information Forensics and Security 2, 3 (2007), 491--502.Google ScholarDigital Library
- Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative Adversarial Nets. In Advances in Neural Information Processing Systems. 2672--2680.Google Scholar
- Abdenour Hadid, Mohammad Ghahramani, Vili Kellokumpu, Xiaoyi Feng, John Bustard, and Mark Nixon. 2015. Gait Biometrics under Spoofing Attacks: An Experimental Investigation. Journal of Electronic Imaging 24, 6 (2015), 063022.Google ScholarCross Ref
- Ju Han and Bir Bhanu. 2006. Individual Recognition using Gait Energy Image. IEEE Transactions on Pattern Analysis and Machine Intelligence 28, 2 (2006), 316-- 322.Google ScholarDigital Library
- Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. 2017. Mask R-CNN. In IEEE International Conference on Computer Vision. 2961--2969.Google Scholar
- Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In IEEE Conference on Computer Vision and Pattern Recognition. 770--778.Google Scholar
- Yiwei He, Junping Zhang, Hongming Shan, and Liang Wang. 2019. Multi-task GANs for View-specific Feature Learning in Gait Recognition. IEEE Transactions on Information Forensics and Security 14, 1 (2019), 102--113.Google ScholarCross Ref
- Maodi Hu, YunhongWang, Zhaoxiang Zhang, James J Little, and Di Huang. 2013. View-invariant Discriminative Projection for Multi-view Gait-based Human Identification. IEEE Transactions on Information Forensics and Security 8, 12 (2013), 2034--2045.Google ScholarDigital Library
- Eddy Ilg, Nikolaus Mayer, Tonmoy Saikia, Margret Keuper, Alexey Dosovitskiy, and Thomas Brox. 2017. Flownet 2.0: Evolution of Optical Flow Estimation with Deep Networks. In IEEE Conference on Computer Vision and Pattern Recognition. 2462--2470.Google ScholarCross Ref
- Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, and Alexei A Efros. 2017. Image-to- Image Translation with Conditional Adversarial Networks. In IEEE Conference on Computer Vision and Pattern Recognition. 5967--5976.Google Scholar
- Mehran Khodabandeh, Hamid Reza Vaezi Joze, Ilya Zharkov, and Vivek Pradeep. 2018. DIY Human Action Dataset Generation. In IEEE Conference on Computer Vision and Pattern Recognition Workshops. 1448--1458.Google Scholar
- Lily Lee and W Eric L Grimson. 2002. Gait Analysis for Recognition and Classification. In IEEE International Conference on Automatic Face Gesture Recognition. IEEE, 155--162.Google Scholar
- Yaling Liang, Chang-Tsun Li, Yu Guan, and Yongjian Hu. 2016. Gait Recognition based on the Golden Ratio. EURASIP Journal on Image and Video Processing 25, 12 (2016), 22.Google ScholarCross Ref
- Rijun Liao, Chunshui Cao, Edel B Garcia, Shiqi Yu, and Yongzhen Huang. 2017. Pose-based Temporal-Spatial Network (PTSN) for Gait Recognition with Carrying and Clothing Variations. In Chinese Conference on Biometric Recognition. 474--483.Google Scholar
- Yasushi Makihara, Ryusuke Sagawa, Yasuhiro Mukaigawa, Tomio Echigo, and Yasushi Yagi. 2006. Gait Recognition using a View Transformation Model in the Frequency Domain. In European Conference on Computer Vision. 151--163.Google ScholarDigital Library
- Takeru Miyato, Toshiki Kataoka, Masanori Koyama, and Yuichi Yoshida. 2018. Spectral Normalization for Generative Adversarial Networks. In International Conference on Learning Representations, 2018.Google Scholar
- Noriko Takemura, Yasushi Makihara, Daigo Muramatsu, Tomio Echigo, and Yasushi Yagi. 2018. Multi-view Large Population Gait Dataset and Its Performance Evaluation for Cross-view Gait Recognition. IPSJ Transactions on Computer Vision and Applications 10, 1 (2018), 4.Google ScholarCross Ref
- Chen Wang, Junping Zhang, Liang Wang, Jian Pu, and Xiaoru Yuan. 2012. Human Identification using Temporal Information Preserving Gait Template. IEEE Transactions on Pattern Analysis and Machine Intelligence 8, 12 (2012), 2164--2176.Google ScholarDigital Library
- Liang Wang, Tieniu Tan, Huazhong Ning, and Weiming Hu. 2003. Silhouette Analysis-based Gait Recognition for Human Identification. IEEE Transactions on Information Forensics and Security 25, 12 (2003), 1505--1518.Google Scholar
- Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Guilin Liu, Andrew Tao, Jan Kautz, and Bryan Catanzaro. 2018. Video-to-Video Synthesis. In Advances in Neural Information Processing Systems. 1144--1156.Google Scholar
- Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, and Bryan Catanzaro. 2018. High-resolution Image Synthesis and Semantic Manipulation with Conditional GANs. In IEEE Conference on Computer Vision and Pattern Recognition. 8798--8807.Google Scholar
- Thomas Wolf, Mohammadreza Babaee, and Gerhard Rigoll. 2016. Multi-view Gait Recognition using 3D Convolutional Neural Networks. In IEEE International Conference on Image Processing. 4165--4169.Google ScholarCross Ref
- Zifeng Wu, Yongzhen Huang, Liang Wang, Xiaogang Wang, and Tieniu Tan. 2017. A Comprehensive Study on Cross-view Gait based Human Identification with Deep CNNs. IEEE Transactions on Pattern Analysis and Machine Intelligence 39, 2 (2017), 209--226.Google ScholarDigital Library
- Zhizheng Wu and Haizhou Li. 2013. Voice Conversion and Spoofing Attack on Speaker Verification Systems. In Asia-Pacific Signal and Information Processing Association Annual Summit and Conference. 1--9.Google Scholar
- ChewYean Yam, Mark S Nixon, and John N Carter. 2004. Automated Person Recognition by Walking and Running via Model-based Approaches. Pattern recognition 37, 5 (2004), 1057--1072.Google Scholar
- Hongyu Yang, Di Huang, Yunhong Wang, and Anil K. Jain. 2018. Learning Face Age Progression: A Pyramid Architecture of GANs. In IEEE Conference on Computer Vision and Pattern Recognition. 31--39.Google Scholar
- Jang-Hee Yoo, Mark S Nixon, and Chris J Harris. 2002. Extracting Gait Signatures based on Anatomical Knowledge. In BMVA Symposium on Advancing Biometric Technologies. 596--606.Google Scholar
- Shiqi Yu, Haifeng Chen, Garcia Reyes, B Edel, and Norman Poh. 2017. Gaitgan: Invariant Gait Feature Extraction using Generative Adversarial Networks. In IEEE Conference on Computer Vision and Pattern Recognition Workshops. 30--37.Google ScholarCross Ref
- Shiqi Yu, Daoliang Tan, and Tieniu Tan. 2006. A Framework for Evaluating the Effect of View Angle, Clothing and Carrying Condition on Gait Recognition. In International Conference on Pattern Recognition. 441--444.Google Scholar
- Shuai Zheng, Junge Zhang, Kaiqi Huang, Ran He, and Tieniu Tan. 2011. Robust ViewTransformation Model for Gait Recognition. In IEEE International Conference on Image Processing. 2073--2076.Google Scholar
Index Terms
- Attacking Gait Recognition Systems via Silhouette Guided GANs
Recommendations
Spoof Attacks on Gait Authentication System
Part 2Research in biometric gait recognition has increased. Earlier gait recognition works reported promising results, usually with a small sample size. Recent studies with a larger sample size confirm gait potential as a biometric from which individuals can ...
A probabilistic image-weighting scheme for robust silhouette-based gait recognition
Many gait recognition methods use silhouettes as a feature due to their simplicity and effectiveness. However, silhouette-based gait recognition algorithms have the drawback of performance degradation when the silhouette images are corrupted. To solve ...
Biometric recognition by gait: A survey of modalities and features
Highlights- A comprehensive survey of biometric gait recognition based on vision, underfoot pressure, accelerometry, and audio sensory modalities.
AbstractThe scientific literature on automated gait analysis for human recognition has grown dramatically over the past 15 years. A number of sensing modalities including those based on vision, sound, pressure, and accelerometry have been used ...
Comments