BriefGPT.xyz
Oct, 2020
面向属性鲁棒性的通用框架 FAR
FAR: A General Framework for Attributional Robustness
HTML
PDF
Adam Ivankay, Ivan Girardi, Chiara Marchiori, Pascal Frossard
TL;DR
该研究提出一种名称为FAR的新型范式,用于通过在输入的局部领域内最小化属性映射的最大差异来训练模型的鲁棒属性。通过新模型AAT和AdvAAT的实验表明,所提出的方法在对抗干扰下都更有稳健性。
Abstract
attribution maps
have gained popularity as tools for explaining
neural networks
predictions. By assigning an importance value to each input dimension that represents their influence towards the outcome, they give
→