BriefGPT.xyz
May, 2022
可解释性评估指标的可求解性
The Solvability of Interpretability Evaluation Metrics
HTML
PDF
Yilun Zhou, Julie Shah
TL;DR
本文介绍了一个解释神经网络预测的特征归因方法,提出了一个问题:为什么我们不使用解释器(例如LIME),而是基于解决度量来优化解释,如果度量值代表了解释质量呢?我们实现了解释器,并发布了Python solvex包,可用于文本、图像和表格等领域的模型。
Abstract
feature attribution methods
are popular for explaining
neural network predictions
, and they are often evaluated on metrics such as comprehensiveness and sufficiency, which are motivated by the principle that more
→