BriefGPT.xyz
Jul, 2023
通过潜在空间分解揭示独特的概念向量
Uncovering Unique Concept Vectors through Latent Space Decomposition
HTML
PDF
Mara Graziani, Laura O' Mahony, An-Phi Nguyen, Henning Müller, Vincent Andrearczyk
TL;DR
该论文提出一种后期无监督方法,通过分解和聚类方法,自动发现深度学习模型中的概念向量,从而支持可解释性分析,可以成功鉴别与疏离数据有关的训练样本
Abstract
Interpreting the inner workings of
deep learning models
is crucial for establishing trust and ensuring model safety.
concept-based explanations
have emerged as a superior approach that is more interpretable than
→