May, 2017

MagNet:对抗样本的双重防御

TL;DRMagNet is proposed as a defense mechanism for neural network classifiers against adversarial examples in deep learning, which learns to differentiate between normal and adversarial examples by approximating the manifold of normal examples and reconstructing adversarial examples by moving them towards the manifold, and it also proposes a mechanism to defend against graybox attack by using diversity to strengthen MagNet.