BriefGPT.xyz
Sep, 2023
探索稳健特征以提升对抗性鲁棒性
Exploring Robust Features for Improving Adversarial Robustness
HTML
PDF
Hong Wang, Yuefan Deng, Shinjae Yoo, Yuewei Lin
TL;DR
通过分离鲁棒特征和领域特定特征,我们提出了一个鲁棒特征分离模型,以增强模型的对抗性鲁棒性。同时,训练出的领域鉴别器几乎完美地能够识别出干净图像和对抗性样本中的领域特定特征,从而无需额外的计算成本实现对抗性样本的检测。这样一来,我们可以为干净图像和对抗性样本指定不同的分类器,避免了干净图像准确率的降低。
Abstract
While
deep neural networks
(DNNs) have revolutionized many fields, their fragility to carefully designed
adversarial attacks
impedes the usage of DNNs in safety-critical applications. In this paper, we strive to
→