BriefGPT.xyz
Apr, 2024
SIDEs: 在xAI中将理想化与欺骗性解释分开
SIDEs: Separating Idealization from Deceptive Explanations in xAI
HTML
PDF
Emily Sullivan
TL;DR
解释性人工智能(xAI)方法对于建立信任使用黑匣子模型至关重要,本文概述了xAI研究参与理想化评估的需求,并介绍了一个评估xAI方法是否成功理想化或具有欺骗性解释的新框架。我们的定性分析发现,领先的特征重要性方法和反事实解释容易出现理想化失效,并提出了改进理想化失效的方法。
Abstract
explainable ai
(xAI) methods are important for establishing trust in using
black-box models
. However, recent criticism has mounted against current xAI methods that they disagree, are necessarily false, and can be
→