BriefGPT.xyz
Jan, 2019
对抗训练的局限性和盲点攻击
The Limitations of Adversarial Training and the Blind-Spot Attack
HTML
PDF
Huan Zhang, Hongge Chen, Zhao Song, Duane Boning, Inderjit S. Dhillon...
TL;DR
本文研究了对抗训练的实用性和难度,发现对抗训练的有效性与测试数据点与网络嵌入的训练数据流形之间的距离有强烈相关性,离流形越远的测试数据点越容易受到对抗攻击,并提出了新型攻击——“盲点攻击”,在任何真实数据点可能存在,其对大规模复杂数据的对抗性训练提出了挑战。
Abstract
The
adversarial training
procedure proposed by Madry et al. (2018) is one of the most effective methods to defend against adversarial examples in
deep neural networks
(DNNs). In our paper, we shed some lights on
→