利用模型 knockoffs 实现基于概念的图像分类器显著性解释

May, 2023

Statistically Significant Concept-based Explanation of Image Classifiers via Model Knockoffs

Kaiwen Xu, Kazuto Fukuchi, Youhei Akimoto, Jun Sakuma

TL;DR本研究提出了一种利用深度学习模型学习图像概念，然后使用Knockoff样本选择重要概念进行预测的方法来控制误发现率，实验结果表明该方法可以控制FDR并选择高度可解释的概念来提高模型的可靠性。

Abstract

A concept-based classifier can explain the decision process of a deep learning model by human-understandable concepts in image classification