Towards Interpreting and Utilizing Symmetry Property in Adversarial Examples

Authors

  • Shibin Mei Shanghai Jiao Tong University
  • Chenglong Zhao Shanghai Jiao Tong University
  • Bingbing Ni Shanghai Jiao Tong University
  • Shengchao Yuan Shanghai Jiao Tong University

DOI:

https://doi.org/10.1609/aaai.v37i8.26095

Keywords:

ML: Representation Learning, CV: Applications

Abstract

In this paper, we identify symmetry property in adversarial scenario by viewing adversarial attack in a fine-grained manner. A newly designed metric called attack proportion, is thus proposed to count the proportion of the adversarial examples misclassified between classes. We observe that the distribution of attack proportion is unbalanced as each class shows vulnerability to particular classes. Further, some class pairs correlate strongly and have the same degree of attack proportion for each other. We call this intriguing phenomenon symmetry property. We empirically prove this phenomenon is widespread and then analyze the reason behind the existence of symmetry property. This explanation, to some extent, could be utilized to understand robust models, which also inspires us to strengthen adversarial defenses.

Downloads

Published

2023-06-26

How to Cite

Mei, S., Zhao, C., Ni, B., & Yuan, S. (2023). Towards Interpreting and Utilizing Symmetry Property in Adversarial Examples. Proceedings of the AAAI Conference on Artificial Intelligence, 37(8), 9126-9133. https://doi.org/10.1609/aaai.v37i8.26095

Issue

Section

AAAI Technical Track on Machine Learning III