SurroCBM: Concept Bottleneck Surrogate Models for Generative Post-hoc Explanation

Pan, Bo; Liu, Zhenke; Zhang, Yifei; Zhao, Liang

Computer Science > Artificial Intelligence

arXiv:2310.07698 (cs)

[Submitted on 11 Oct 2023]

Title:SurroCBM: Concept Bottleneck Surrogate Models for Generative Post-hoc Explanation

Authors:Bo Pan, Zhenke Liu, Yifei Zhang, Liang Zhao

View PDF

Abstract:Explainable AI seeks to bring light to the decision-making processes of black-box models. Traditional saliency-based methods, while highlighting influential data segments, often lack semantic understanding. Recent advancements, such as Concept Activation Vectors (CAVs) and Concept Bottleneck Models (CBMs), offer concept-based explanations but necessitate human-defined concepts. However, human-annotated concepts are expensive to attain. This paper introduces the Concept Bottleneck Surrogate Models (SurroCBM), a novel framework that aims to explain the black-box models with automatically discovered concepts. SurroCBM identifies shared and unique concepts across various black-box models and employs an explainable surrogate model for post-hoc explanations. An effective training strategy using self-generated data is proposed to enhance explanation quality continuously. Through extensive experiments, we demonstrate the efficacy of SurroCBM in concept discovery and explanation, underscoring its potential in advancing the field of explainable AI.

Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2310.07698 [cs.AI]
	(or arXiv:2310.07698v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2310.07698

Submission history

From: Bo Pan [view email]
[v1] Wed, 11 Oct 2023 17:46:59 UTC (2,504 KB)

Computer Science > Artificial Intelligence

Title:SurroCBM: Concept Bottleneck Surrogate Models for Generative Post-hoc Explanation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:SurroCBM: Concept Bottleneck Surrogate Models for Generative Post-hoc Explanation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators