BriefGPT.xyz
Mar, 2023
使用Taylor相互作用理解和统一十四种归因方法
Understanding and Unifying Fourteen Attribution Methods with Taylor Interactions
HTML
PDF
Huiqi Deng, Na Zou, Mengnan Du, Weifu Chen, Guocan Feng...
TL;DR
本文首次将诸多启发式设计的14种归因方法的核心机制,统一为一个数学系统,证明这14种方法的归因得分都可以重构为两种效应的加权求和,即每个输入变量的独立效应和输入变量之间的相互作用效应,并提出3个公平分配效应的原则来评价这14种归因方法的忠诚度。
Abstract
Various
attribution methods
have been developed to explain
deep neural networks
(DNNs) by inferring the attribution/importance/contribution score of each input variable to the final output. However, existing
→