skip to main content
10.1145/3539618.3592070acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

The Dark Side of Explanations: Poisoning Recommender Systems with Counterfactual Examples

Published:18 July 2023Publication History

ABSTRACT

Deep learning-based recommender systems have become an integral part of several online platforms. However, their black-box nature emphasizes the need for explainable artificial intelligence (XAI) approaches to provide human-understandable reasons why a specific item gets recommended to a given user. One such method is counterfactual explanation (CF). While CFs can be highly beneficial for users and system designers, malicious actors may also exploit these explanations to undermine the system's security.

In this work, we propose H-CARS, a novel strategy to poison recommender systems via CFs. Specifically, we first train a logical-reasoning-based surrogate model on training data derived from counterfactual explanations. By reversing the learning process of the recommendation model, we thus develop a proficient greedy algorithm to generate fabricated user profiles and their associated interaction records for the aforementioned surrogate model. Our experiments, which employ a well-known CF generation method and are conducted on two distinct datasets, show that H-CARS yields significant and successful attack performance.

References

  1. Ulrich Aïvodji, Alexandre Bolot, and Sébastien Gambs. 2020. Model Extraction from Counterfactual Explanations. arXiv preprint arXiv:2009.01884 (2020).Google ScholarGoogle Scholar
  2. Hanxiong Chen, Shaoyun Shi, Yunqi Li, and Yongfeng Zhang. 2021. Neural Collaborative Reasoning. In Proc. of TheWebConf '21. 1516--1527.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Ziheng Chen, Fabrizio Silvestri, Jia Wang, Yongfeng Zhang, Zhenhua Huang, Hongshik Ahn, and Gabriele Tolomei. 2022. GREASE: Generate Factual and Counterfactual Explanations for GNN-based Recommendations. arXiv preprint arXiv:2208.04222 (2022).Google ScholarGoogle Scholar
  4. Ziheng Chen, Fabrizio Silvestri, Jia Wang, He Zhu, Hongshik Ahn, and Gabriele Tolomei. 2022. ReLAX: Reinforcement Learning Agent Explainer for Arbitrary Predictive Models. In Proc. of CIKM '22. ACM, 252--261.Google ScholarGoogle Scholar
  5. Vasisht Duddu and Antoine Boutet. 2022. Inferring Sensitive Attributes from Model Explanations. In Proc. of CIKM '22. ACM, 416--425.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Songyang Han, Sanbao Su, Sihong He, Shuo Han, Haizhao Yang, and Fei Miao. 2022. What is the Solution for State Adversarial Multi-Agent Reinforcement Learning? arXiv preprint arXiv:2212.02705 (2022).Google ScholarGoogle Scholar
  7. Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural Collaborative Filtering. In Proc. of WWW '17. 173--182.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Hai Huang, Jiaming Mu, Neil Zhenqiang Gong, Qi Li, Bin Liu, and Mingwei Xu. 2021. Data Poisoning Attacks to Deep Learning Based Recommender Systems. arXiv preprint arXiv:2101.02644 (2021).Google ScholarGoogle Scholar
  9. Ruoming Jin, Dong Li, Jing Gao, Zhi Liu, Li Chen, and Yang Zhou. 2021. Towards a Better Understanding of Linear Models for Recommendation. In Proc. of KDD'21. ACM, 776--785.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Amir-Hossein Karimi, Gilles Barthe, Borja Balle, and Isabel Valera. 2020. Model-Agnostic Counterfactual Explanations for Consequential Decisions. In Proc. of AISTATS '20, Vol. 108. PMLR, 895--905.Google ScholarGoogle Scholar
  11. Ruoyan Kong, Haiyi Zhu, and Joseph A Konstan. 2021. Learning to Ignore: A Case Study of Organization-Wide Bulk Email Effectiveness. Proc. of the ACM on Human-Computer Interaction 5, CSCW1 (2021), 1--23.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Thai Le, Suhang Wang, and Dongwon Lee. 2020. GRACE: Generating Concise and Informative Contrastive Sample to Explain Neural Network Model's Prediction. In Proc. of KDD '20. ACM, 238--248.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Dong Li, Ruoming Jin, Jing Gao, and Zhi Liu. 2020. On Sampling Top-k Recommendation Evaluation. In Proc. of KDD '20. ACM, 2114--2124.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Wei Li, Li Fan, Zhenyu Wang, Chao Ma, and Xiaohui Cui. 2021. Tackling Mode Collapse in Multi-Generator GANs with Orthogonal Vectors. Pattern Recognition 110 (2021), 107646.Google ScholarGoogle ScholarCross RefCross Ref
  15. Wei Li, Zhixuan Liang, Ping Ma, Ruobei Wang, Xiaohui Cui, and Ping Chen. 2021. Hausdorff GAN: Improving GAN Generation Quality with Hausdorff Metric. IEEE Transactions on Cybernetics (2021).Google ScholarGoogle Scholar
  16. Xiaohan Li, Zheng Liu, Luyi Ma, Kaushiki Nag, Stephen Guo, S Yu Philip, and Kannan Achan. 2022. Mitigating Frequency Bias in Next-Basket Recommendation via Deconfounders. In Proc. of BigData '22. IEEE, 616--625.Google ScholarGoogle ScholarCross RefCross Ref
  17. Xiaohan Li, Mengqi Zhang, Shu Wu, Zheng Liu, Liang Wang, and S Yu Philip. 2020. Dynamic Graph Collaborative Filtering. In Proc. of ICDM '20. IEEE, 322--331.Google ScholarGoogle ScholarCross RefCross Ref
  18. Ana Lucic, Harrie Oosterhuis, Hinda Haned, and Maarten de Rijke. 2022. FOCUS: Flexible Optimizable Counterfactual Explanations for Tree Ensembles. In Proc. of AAAI '22. AAAI Press, 5313--5322.Google ScholarGoogle Scholar
  19. Ramaravind Kommiya Mothilal, Amit Sharma, and Chenhao Tan. 2020. Explaining Machine Learning Classifiers through Diverse Counterfactual Explanations. In Proc. of FAT* '20. ACM, 607--617.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Shanlei Mu, Yaliang Li, Wayne Xin Zhao, Jingyuan Wang, Bolin Ding, and Ji-Rong Wen. 2022. Alleviating Spurious Correlations in Knowledge-Aware Recommendations through Counterfactual Generator. In Proc. of SIGIR '22. ACM, 1401--1411.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Martin Pawelczyk, Himabindu Lakkaraju, and Seth Neel. 2022. On the Privacy Risks of Algorithmic Recourse. arXiv preprint arXiv:2211.05427 (2022).Google ScholarGoogle Scholar
  22. Federico Siciliano, Maria Sofia Bucarelli, Gabriele Tolomei, and Fabrizio Silvestri. 2022. NEWRON: A New Generalization of the Artificial Neuron to Enhance the Interpretability of Neural Networks. In Proc. of IJCNN '22. IEEE, 1--17.Google ScholarGoogle ScholarCross RefCross Ref
  23. Jiaxi Tang, Hongyi Wen, and Ke Wang. 2020. Revisiting Adversarially Learned Injection Attacks against Recommender Systems. In Proc. of RecSys '20. ACM, 318--327.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Gabriele Tolomei and Fabrizio Silvestri. 2021. Generating Actionable Interpretations from Ensembles of Decision Trees. IEEE TKDE 33, 4 (2021), 1540--1553.Google ScholarGoogle Scholar
  25. Gabriele Tolomei, Fabrizio Silvestri, Andrew Haines, and Mounia Lalmas. 2017. Interpretable Predictions of Tree-based Ensembles via Actionable Feature Tweaking. In Proc. of KDD '17. ACM, 465--474.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Khanh Hiep Tran, Azin Ghazimatin, and Rishiraj Saha Roy. 2021. Counterfactual Explanations for Neural Recommenders. In Proc. of SIGIR '21. ACM, 1627--1631.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Xinghua Wang, Zhaohui Peng, Senzhang Wang, Philip S Yu, Wenjing Fu, Xiaokang Xu, and Xiaoguang Hong. 2020. CDLFM: Cross-Domain Recommendation for Cold-Start Users via Latent Feature Mapping. Knowledge and Information Systems 62 (2020), 1723--1750.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Yongjie Wang, Hangwei Qian, and Chunyan Miao. 2022. DualCF: Efficient Model Extraction Attack from Counterfactual Explanations. In Proc. of FAccT '22. ACM, 1318--1329.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Xin Xin, Xiangnan He, Yongfeng Zhang, Yongdong Zhang, and Joemon Jose. 2019. Relational Collaborative Filtering: Modeling Multiple Item Relations for Recommendation. In Proc. of SIGIR '19. ACM, 125--134.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Hengtong Zhang, Changxin Tian, Yaliang Li, Lu Su, Nan Yang, Wayne Xin Zhao, and Jing Gao. 2021. Data Poisoning Attack against Recommender System Using Incomplete and Perturbed Data. In Proc. of KDD '21. ACM, 2154--2164.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Hongke Zhao, Qi Liu, Yong Ge, Ruoyan Kong, and Enhong Chen. 2016. Group Preference Aggregation: A Nash Equilibrium Approach. In Proc. of ICDM '16. IEEE, 679--688.Google ScholarGoogle ScholarCross RefCross Ref
  32. Xuejun Zhao, Wencan Zhang, Xiaokui Xiao, and Brian Lim. 2021. Exploiting Explanations for Model Inversion Attacks. In Proc. of ICCV '21. IEEE, 682--692.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. The Dark Side of Explanations: Poisoning Recommender Systems with Counterfactual Examples

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGIR '23: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval
      July 2023
      3567 pages
      ISBN:9781450394086
      DOI:10.1145/3539618

      Copyright © 2023 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 18 July 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      Overall Acceptance Rate792of3,983submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader