对抗样本的几何性质

Nov, 2018

On the Geometry of Adversarial Examples

Marc Khoury, Dylan Hadfield-Menell

TL;DR该研究提出了一种基于几何框架和流形重建方法的方法，以分析对抗样本的高维几何形状，并证明了不同规范的鲁棒性、球形对抗性训练的样本编号和最近邻分类器与基于球面的对抗训练的充分采样条件。

Abstract

adversarial examples are a pervasive phenomenon of machine learning models where seemingly imperceptible perturbations to the input lead to misclassifications for otherwise statistically accurate models. We propo