BriefGPT.xyz
May, 2019
鲁棒归因正则化
Robust Attribution Regularization
HTML
PDF
Jiefeng Chen, Xi Wu, Vaibhav Rastogi, Yingyu Liang, Somesh Jha
TL;DR
通过公理归因神经网络的视角,我们提出了经典鲁棒优化模型的训练目标,旨在实现鲁棒的集成梯度归因。实验结果表明了我们方法的有效性,并表明需要更好的优化技术或更好的神经网络架构来进行鲁棒的归因训练。
Abstract
An emerging problem in
trustworthy machine learning
is to train models that produce robust interpretations for their predictions. We take a step towards solving this problem through the lens of
axiomatic attribution
→