BriefGPT.xyz
Jun, 2020
平滑几何用于鲁棒归因
Smoothed Geometry for Robust Attribution
HTML
PDF
Zifan Wang, Haofan Wang, Shakul Ramkumar, Matt Fredrikson, Piotr Mardziel...
TL;DR
该文章提出了一种用于改善深度神经网络中当前解释工具易受攻击的局限性的正则化方法(包括Lipschitz连续性的条件)和随机平滑技术,并在各种图像模型上进行实验以验证其效果和证明平滑几何在这些对真实大规模模型的攻击中所起的作用。
Abstract
Feature attributions are a popular tool for explaining the behavior of
deep neural networks
(DNNs), but have recently been shown to be vulnerable to attacks that produce divergent explanations for nearby inputs. This lack of
→