BriefGPT.xyz
Feb, 2023
在对抗训练中提高最劣鲁棒性
WAT: Improve the Worst-class Robustness in Adversarial Training
HTML
PDF
Boqi Li, Weiwei Liu
TL;DR
本文提出了一种最差类对抗训练(worst-class adversarial training)的新框架,利用无悔动态来解决对抗样本攻击的问题,旨在获得在最差情况下表现优异的分类器,并在同时仅牺牲少量平均鲁棒性。作者在各种数据集和网络上的实验证明了该方法超越了现有方法。
Abstract
deep neural networks
(DNN) have been shown to be vulnerable to
adversarial examples
.
adversarial training
(AT) is a popular and effective
→