research-article

Attention-guided Adversarial Attack for Video Object Segmentation

Authors:
Rui Yao

School of Computer Science and Technology, China University of Mining and Technology, Engineering Research Center of Mine Digitization, Ministry of Education of the Peoples Republic of China, China

School of Computer Science and Technology, China University of Mining and Technology, Engineering Research Center of Mine Digitization, Ministry of Education of the Peoples Republic of China, China
View Profile

,
Ying Chen

School of Computer Science and Technology, China University of Mining and Technology, Engineering Research Center of Mine Digitization, Ministry of Education of the Peoples Republic of China, China

School of Computer Science and Technology, China University of Mining and Technology, Engineering Research Center of Mine Digitization, Ministry of Education of the Peoples Republic of China, China
View Profile

,
Yong Zhou

School of Computer Science and Technology, China University of Mining and Technology, Engineering Research Center of Mine Digitization, Ministry of Education of the Peoples Republic of China, China

School of Computer Science and Technology, China University of Mining and Technology, Engineering Research Center of Mine Digitization, Ministry of Education of the Peoples Republic of China, China
View Profile

,
Fuyuan Hu

School of Electronic and Information Engineering, Suzhou University of Science and Technology, China

School of Electronic and Information Engineering, Suzhou University of Science and Technology, China
View Profile

,
Jiaqi Zhao

School of Computer Science and Technology, China University of Mining and Technology, China

School of Computer Science and Technology, China University of Mining and Technology, China
View Profile

,
Bing Liu

School of Computer Science and Technology, China University of Mining and Technology, China

School of Computer Science and Technology, China University of Mining and Technology, China
View Profile

,
Zhiwen Shao

School of Computer Science and Technology, China University of Mining and Technology, China

School of Computer Science and Technology, China University of Mining and Technology, China
View Profile

ACM Transactions on Intelligent Systems and Technology Volume 14 Issue 6Article No.: 102pp 1–22https://doi.org/10.1145/3617067

Published:14 November 2023Publication History

ACM Transactions on Intelligent Systems and Technology

Abstract

Video Object Segmentation (VOS) methods have made many breakthroughs with the help of the continuous development and advancement of deep learning. However, the deep learning model is vulnerable to malicious adversarial attacks, which mislead the model to make wrong decisions by adding adversarial perturbation that humans cannot perceive to the input image. Threats to deep learning models remind us that video object segmentation methods are also vulnerable to attacks, thereby threatening their security. Therefore, we study adversarial attacks on the VOS task to better identify the vulnerabilities of the VOS method, which in turn provides an opportunity to improve its robustness. In this paper, we propose an attention-guided adversarial attack method, which uses spatial attention blocks to capture features with global dependencies to construct correlations between consecutive video frames, and performs multipath aggregation to effectively integrate spatial-temporal perturbation, thereby guiding the deconvolution network to generate adversarial examples with strong attack capability. Specifically, the class loss function is designed to enable the deconvolution network to better activate noise in other regions and suppress the activation related to the object class based on the enhanced feature map of the object class. At the same time, attentional feature loss is designed to enhance the transferability against attack. The experimental results on the DAVIS dataset show that the proposed attention-guided adversarial attack method can significantly reduce the segmentation accuracy of OSVOS, and the J&F mean on DAVIS 2016 can reach 73.6% drop rate. The generated adversarial examples are also highly transferable to other video object segmentation models.

REFERENCES

[1] Caelles S., Maninis K.-K., Pont-Tuset J., Leal-Taixé L., Cremers D., and Gool L. Van. 2017. One-shot video object segmentation. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17). 5320–5329. DOI:Google ScholarCross Ref
[2] Carion Nicolas, Massa Francisco, Synnaeve Gabriel, Usunier Nicolas, Kirillov Alexander, and Zagoruyko Sergey. 2020. End-to-end object detection with transformers. In European Conference on Computer Vision. Springer, 213–229.Google ScholarDigital Library
[3] Chen Xi, Li Zuoxin, Yuan Ye, Yu Gang, Shen Jianxin, and Qi Donglian. 2020. State-aware tracker for real-time video object segmentation. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20). 9381–9390. DOI:Google ScholarCross Ref
[4] Chen Yuhua, Pont-Tuset Jordi, Montes Alberto, and Gool Luc Van. 2018. Blazingly fast video object segmentation with pixel-wise metric learning. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1189–1198. DOI:Google ScholarCross Ref
[5] Cheng Jingchun, Tsai Yi-Hsuan, Hung Wei-Chih, Wang Shengjin, and Yang Ming-Hsuan. 2018. Fast and accurate online video object segmentation via tracking parts. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 7415–7424. DOI:Google ScholarCross Ref
[6] Dong Yinpeng, Liao Fangzhou, Pang Tianyu, Su Hang, Zhu Jun, Hu Xiaolin, and Li Jianguo. 2018. Boosting adversarial attacks with momentum. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 9185–9193. DOI:Google ScholarCross Ref
[7] Feng Xinyang, Song Dongjin, Chen Yuncong, Chen Zhengzhang, Ni Jingchao, and Chen Haifeng. 2021. Convolutional transformer based dual discriminator generative adversarial networks for video anomaly detection. In Proceedings of the 29th ACM International Conference on Multimedia. 5546–5554.Google ScholarDigital Library
[8] Fu Yang, Wang Xiaoyang, Wei Yunchao, and Huang Thomas. 2019. STA: Spatial-temporal attention for large-scale video-based person re-identification. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 8287–8294.Google ScholarDigital Library
[9] Goodfellow I. J., Shlens J., and Szegedy C.. 2014. Explaining and harnessing adversarial examples. Computer Science (2014).Google Scholar
[10] Guo Qing, Cheng Ziyi, Juefei-Xu Felix, Ma Lei, Xie Xiaofei, Liu Yang, and Zhao Jianjun. 2021. Learning to adversarially blur visual object tracking. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 10839–10848.Google ScholarCross Ref
[11] Guo Qing, Xie Xiaofei, Juefei-Xu Felix, Ma Lei, Li Zhongguo, Xue Wanli, Feng Wei, and Liu Yang. 2020. SPARK: Spatial-aware online incremental attack against visual tracking. In European Conference on Computer Vision. Springer, 202–219.Google ScholarDigital Library
[12] Han Junwei, Yang Le, Zhang Dingwen, Chang Xiaojun, and Liang Xiaodan. 2018. Reinforcement cutting-agent learning for video object segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 9080–9089.Google ScholarCross Ref
[13] Hu Jie, Shen Li, and Sun Gang. 2018. Squeeze-and-excitation networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18).Google ScholarCross Ref
[14] Hu P., Liu J., Wang G., Ablavsky V., and Sclaroff S.. 2020. DIPNet: Dynamic identity propagation network for video object segmentation. In IEEE Winter Conference on Applications of Computer Vision (WACV’20).Google ScholarCross Ref
[15] Hu Y. T., Huang J. B., and Schwing A. G.. 2018. MaskRNN: Instance level video object segmentation. Advances in Neural Information Processing Systems 2017-December (2018), 325–334.Google Scholar
[16] Huang Peiliang, Han Junwei, Liu Nian, Ren Jun, and Zhang Dingwen. 2021. Scribble-supervised video object segmentation. IEEE/CAA Journal of Automatica Sinica 9, 2 (2021), 339–353.Google ScholarCross Ref
[17] Huang Xuhua, Xu Jiarui, Tai Yu-Wing, and Tang Chi-Keung. 2020. Fast video object segmentation with temporal aggregation network and dynamic template matching. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20). 8876–8886. DOI:Google ScholarCross Ref
[18] Inkawhich Nathan, Inkawhich Matthew, Chen Yiran, and Li Hai. 2018. Adversarial attacks for optical flow-based action recognition classifiers. arXiv preprint arXiv:1811.11875 (2018).Google Scholar
[19] Ioffe Sergey and Szegedy Christian. 2014. Batch Normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167 (2014).Google Scholar
[20] Jabri Allan, Owens Andrew, and Efros Alexei. 2020. Space-time correspondence as a contrastive random walk. Advances in Neural Information Processing Systems 33 (2020), 19545–19560.Google Scholar
[21] Jaderberg Max, Simonyan Karen, Zisserman Andrew, and Kavukcuoglu Koray. 2015. Spatial transformer networks. Advances in Neural Information Processing Systems 28 (2015).Google Scholar
[22] Jampani Varun, Gadde Raghudeep, and Gehler Peter V.. 2017. Video propagation networks. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17). 3154–3164. DOI:Google ScholarCross Ref
[23] Jia Shuai, Song Yibing, Ma Chao, and Yang Xiaokang. 2021. IoU attack: Towards temporally coherent black-box adversarial attack for visual object tracking. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 6709–6718.Google ScholarCross Ref
[24] Jiang Linxi, Ma Xingjun, Chen Shaoxiang, Bailey James, and Jiang Yu-Gang. 2019. Black-box adversarial attacks on video recognition models. In Proceedings of the 27th ACM International Conference on Multimedia. 864–872.Google ScholarDigital Library
[25] Johnander Joakim, Danelljan Martin, Brissman Emil, Khan Fahad Shahbaz, and Felsberg Michael. 2019. A generative appearance model for end-to-end video object segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8953–8962.Google ScholarCross Ref
[26] Khoreva A., Benenson R., Ilg E., Brox T., and Schiele B.. 2019. Lucid data dreaming for video object segmentation. International Journal of Computer Vision (2019).Google ScholarDigital Library
[27] Khoreva A., Perazzi F., Benenson R., Schiele B., and Sorkine-Hornung A.. 2017. Learning video object segmentation from static images. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17).Google Scholar
[28] Krizhevsky Alex, Sutskever Ilya, and Hinton Geoffrey E.. 2012. ImageNet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems 25 (2012).Google Scholar
[29] Kumar Deepak, Kumar Chetan, Seah Chun Wei, Xia Siyu, and Shao Ming. 2020. Finding Achilles’ heel: Adversarial attack on multi-modal action recognition. In Proceedings of the 28th ACM International Conference on Multimedia. 3829–3837.Google ScholarDigital Library
[30] Kurakin A., Goodfellow I., and Bengio S.. 2016. Adversarial examples in the physical world. ArXiv abs/1607.02533 (2016).Google Scholar
[31] Li X. and Loy C. C.. 2018. Video object segmentation with joint re-identification and attention-aware mask propagation. (2018).Google Scholar
[32] Li Y., Tian D., Mingching-Chang, Bian X., and Lyu S.. 2018. Robust adversarial perturbation on deep proposal-based models. ArXiv abs/1809.05962 (2018).Google Scholar
[33] Ling Xiang, Ji Shouling, Zou Jiaxu, Wang Jiannan, Wu Chunming, Li Bo, and Wang Ting. 2019. DEEPSEC: A uniform platform for security analysis of deep learning model. In 2019 IEEE Symposium on Security and Privacy (SP’19). IEEE, 673–690.Google Scholar
[34] Liu Jinyuan, Shang Jingjie, Liu Risheng, and Fan Xin. 2022. Attention-guided global-local adversarial learning for detail-preserving multi-exposure image fusion. IEEE Transactions on Circuits and Systems for Video Technology (2022).Google Scholar
[35] Long Jonathan, Shelhamer Evan, and Darrell Trevor. 2015. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3431–3440.Google ScholarCross Ref
[36] Lu Xiankai, Wang Wenguan, Ma Chao, Shen Jianbing, Shao Ling, and Porikli Fatih. 2019. See more, know more: Unsupervised video object segmentation with co-attention siamese networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 3623–3632.Google ScholarCross Ref
[37] Lu Xiankai, Wang Wenguan, Shen Jianbing, Tai Yu-Wing, Crandall David J., and Hoi Steven C. H.. 2020. Learning video object segmentation from unlabeled videos. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8960–8970.Google ScholarCross Ref
[38] Madry A., Makelov A., Schmidt L., Tsipras D., and Vladu A.. 2017. Towards deep learning models resistant to adversarial attacks. ArXiv abs/1706.06083 (2017).Google Scholar
[39] Mnih Volodymyr, Heess Nicolas, Graves Alex, and Kavukcuoglu Koray. 2014. Recurrent models of visual attention. In Advances in Neural Information Processing Systems, Ghahramani Z., Welling M., Cortes C., Lawrence N., and Weinberger K. Q. (Eds.), Vol. 27. Curran Associates, Inc. https://proceedings.neurips.cc/paper/2014/file/09c6c3783b4a70054da74f2538ed47c6-Paper.pdfGoogle ScholarDigital Library
[40] Moosavi-Dezfooli S. M., Fawzi A., Fawzi O., and Frossard P.. 2017. Universal adversarial perturbations. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17).Google ScholarCross Ref
[41] Nakka K. K. and Salzmann M.. 2020. Indirect local attacks for context-aware semantic segmentation networks. In European Conference on Computer Vision.Google ScholarDigital Library
[42] Oh Seoung Wug, Lee Joon-Young, Sunkavalli Kalyan, and Kim Seon Joo. 2018. Fast video object segmentation by reference-guided mask propagation. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 7376–7385. DOI:Google ScholarCross Ref
[43] Oh Seoung Wug, Lee Joon-Young, Xu Ning, and Kim Seon Joo. 2019. Video object segmentation using space-time memory networks. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV’19). 9225–9234. DOI:Google ScholarCross Ref
[44] Park Hyojin, Yoo Jayeon, Jeong Seohyeong, Venkatesh Ganesh, and Kwak Nojun. 2021. Learning dynamic network using a reuse gate function in semi-supervised video object segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8405–8414.Google ScholarCross Ref
[45] Paszke A., Gross S., Massa F., Lerer A., and Chintala S.. 2019. PyTorch: An imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems, Vol. 32. https://proceedings.neurips.cc/paper/2019/file/bdbca288fee7f92f2bfa9f7012727740-Paper.pdfGoogle Scholar
[46] Perazzi F., Pont-Tuset J., McWilliams B., Gool L. Van, Gross M., and Sorkine-Hornung A.. 2016. A benchmark dataset and evaluation methodology for video object segmentation. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16). 724–732. DOI:Google ScholarCross Ref
[47] Pont-Tuset Jordi, Perazzi Federico, Caelles Sergi, Arbeláez Pablo, Sorkine-Hornung Alexander, and Gool Luc Van. 2017. The 2017 DAVIS challenge on video object segmentation. ArXiv abs/1704.00675 (2017).Google Scholar
[48] Robinson Andreas, Lawin Felix Järemo, Danelljan Martin, Khan Fahad Shahbaz, and Felsberg Michael. 2020. Learning fast and robust target models for video object segmentation. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20). 7404–7413. DOI:Google ScholarCross Ref
[49] Seong H., Hyun J., and Kim E.. 2020. Kernelized memory network for video object segmentation. In ECCV 2020: European Conference on Computer Vision.Google ScholarDigital Library
[50] Szegedy C., Zaremba W., Sutskever I., Bruna J., Erhan D, Goodfellow I., and Fergus R.. 2013. Intriguing properties of neural networks. Computer Science (2013).Google Scholar
[51] Voigtlaender P., Chai Y., Schroff F., Adam H., Leibe B., and Chen L. C.. 2019. FEELVOS: Fast end-to-end embedding learning for video object segmentation. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’19).Google ScholarCross Ref
[52] Vondrick Carl, Shrivastava Abhinav, Fathi Alireza, Guadarrama Sergio, and Murphy Kevin. 2018. Tracking emerges by colorizing videos. In Proceedings of the European Conference on Computer Vision (ECCV’18). 391–408.Google ScholarDigital Library
[53] Wang Qiangchang, Wu Tianyi, Zheng He, and Guo Guodong. 2020. Hierarchical pyramid diverse attention networks for face recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20).Google ScholarCross Ref
[54] Wang Qiang, Zhang Li, Bertinetto Luca, Hu Weiming, and Torr Philip H. S.. 2019. Fast online object tracking and segmentation: A unifying approach. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1328–1338.Google ScholarCross Ref
[55] Wang Wenguan, Lu Xiankai, Shen Jianbing, Crandall David J., and Shao Ling. 2019. Zero-shot video object segmentation via attentive graph neural networks. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 9236–9245.Google ScholarCross Ref
[56] Wei Xingxing, Liang Siyuan, Chen Ning, and Cao Xiaochun. 2018. Transferable adversarial attacks for image and video object detection. arXiv preprint arXiv:1811.12641 (2018).Google Scholar
[57] Wei Xingxing, Zhu Jun, Yuan Sha, and Su Hang. 2019. Sparse adversarial perturbations for videos. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 8973–8980.Google ScholarDigital Library
[58] Wei Zhipeng, Chen Jingjing, Wei Xingxing, Jiang Linxi, Chua Tat-Seng, Zhou Fengfeng, and Jiang Yu-Gang. 2020. Heuristic black-box adversarial attacks on video recognition models. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 12338–12345.Google ScholarCross Ref
[59] Wei Zhipeng, Chen Jingjing, Wu Zuxuan, and Jiang Yu-Gang. 2022. Boosting the transferability of video adversarial examples via temporal translation. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. 2659–2667.Google ScholarCross Ref
[60] Wiyatno Rey and Xu Anqi. 2019. Physical adversarial textures that fool visual object tracking. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV’19). 4821–4830. DOI:Google ScholarCross Ref
[61] Xie C., Wang J., Zhang Z., Zhou Y., and Yuille A.. 2017. Adversarial examples for semantic segmentation and object detection. In 2017 IEEE International Conference on Computer Vision (ICCV’17).Google ScholarCross Ref
[62] Xie Saining and Tu Zhuowen. 2015. Holistically-nested edge detection. In 2015 IEEE International Conference on Computer Vision (ICCV’15). 1395–1403. DOI:Google ScholarDigital Library
[63] Yang Linjie, Wang Yanran, Xiong Xuehan, Yang Jianchao, and Katsaggelos Aggelos K.. 2018. Efficient video object segmentation via network modulation. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 6499–6507. DOI:Google ScholarCross Ref
[64] Yang Zongxin, Wei Yunchao, and Yang Yi. 2020. Collaborative video object segmentation by foreground-background integration. In European Conference on Computer Vision. Springer, 332–348.Google ScholarDigital Library
[65] Yoon Jae Shin, Rameau Francois, Kim Junsik, Lee Seokju, Shin Seunghak, and Kweon In So. 2017. Pixel-level matching for video object segmentation using convolutional neural networks. In 2017 IEEE International Conference on Computer Vision (ICCV’17). 2186–2195. DOI:Google ScholarCross Ref
[66] Zeng Xiaohui, Liao Renjie, Gu Li, Xiong Yuwen, Fidler Sanja, and Urtasun Raquel. 2019. DMM-Net: Differentiable mask-matching network for video object segmentation. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV’19). 3928–3937. DOI:Google ScholarCross Ref
[67] Zhang Dingwen, Han Junwei, Yang Le, and Xu Dong. 2018. SPFTN: A joint learning framework for localizing and segmenting objects in weakly labeled videos. IEEE Transactions on Pattern Analysis and Machine Intelligence 42, 2 (2018), 475–489.Google ScholarCross Ref
[68] Zhang Weichao, Wang Guanjun, Huang Mengxing, Wang Hongyu, and Wen Shaoping. 2021. Generative adversarial networks for abnormal event detection in videos based on self-attention mechanism. IEEE Access 9 (2021), 124847–124860.Google ScholarCross Ref
[69] Zhang Yizhuo, Wu Zhirong, Peng Houwen, and Lin Stephen. 2020. A transductive approach for video object segmentation. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20). 6947–6956. DOI:Google ScholarCross Ref
[70] Zhao Y., Zhu H., Liang R., Shen Q., Zhang S., and Chen K.. 2018. Seeing isn’t believing: Practical adversarial attack against object detectors. ArXiv abs/1812.10217 (2018).Google Scholar

Index Terms

Attention-guided Adversarial Attack for Video Object Segmentation
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision

Recommendations

Exploring the Adversarial Robustness of Video Object Segmentation via One-shot Adversarial Attacks
MM '23: Proceedings of the 31st ACM International Conference on Multimedia

Video object segmentation (VOS) is a fundamental task for computer vision and multimedia. Despite significant progress of VOS models in recent works, there has been little research on the VOS models' adversarial robustness, posing serious security risks ...
Read More
PlAA: Pixel-level Adversarial Attack on Attention for Deep Neural Network
Artificial Neural Networks and Machine Learning – ICANN 2022
Abstract
Deep Neural Networks (DNNs) have demonstrated excellent performance in many fields. However, existing studies have shown that deep neural networks are very susceptible to well-designed adversarial samples. Adversarial samples cause the system to ...
Read More
Adversarial Attack against Modeling Attack on PUFs
DAC '19: Proceedings of the 56th Annual Design Automation Conference 2019

The Physical Unclonable Function (PUF) has been proposed for the identification and authentication of devices and cryptographic key generation. A strong PUF provides an extremely large number of device-specific challenge-response pairs (CRP) which can ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on Intelligent Systems and Technology Volume 14, Issue 6
December 2023
493 pages
ISSN:2157-6904
EISSN:2157-6912
DOI:10.1145/3632517
Editor:
Huan Liu
Arizona State University, USA
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 14 November 2023
- Online AM: 2 September 2023
- Accepted: 11 August 2023
- Revised: 23 January 2023
- Received: 12 January 2022
Published in tist Volume 14, Issue 6

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Video object segmentation
adversarial attack
attention-guided
deconvolution network
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 230
  Total Downloads
- Downloads (Last 12 months)230
- Downloads (Last 6 weeks)35
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

View Full Text

Attention-guided Adversarial Attack for Video Object Segmentation

ACM Transactions on Intelligent Systems and Technology

Abstract

REFERENCES

Cited By

Index Terms

Recommendations

Exploring the Adversarial Robustness of Video Object Segmentation via One-shot Adversarial Attacks

PlAA: Pixel-level Adversarial Attack on Attention for Deep Neural Network

Adversarial Attack against Modeling Attack on PUFs

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Full Text

Caption

Attention-guided Adversarial Attack for Video Object Segmentation

ACM Transactions on Intelligent Systems and Technology

Abstract

REFERENCES

Cited By

Index Terms

Recommendations

Exploring the Adversarial Robustness of Video Object Segmentation via One-shot Adversarial Attacks

PlAA: Pixel-level Adversarial Attack on Attention for Deep Neural Network

Adversarial Attack against Modeling Attack on PUFs

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Full Text

Share this Publication link

Share on Social Media