BriefGPT.xyz
Apr, 2022
动态调节对抗性对手的对抗微调
Adversarial Fine-tune with Dynamically Regulated Adversary
HTML
PDF
Pengyue Hou, Ming Zhou, Jie Han, Petr Musilek, Xingyu Li
TL;DR
本文提出了一种简单而有效的基于迁移学习的对抗性训练策略,该策略将对抗样本的负面影响与模型的标准性能分离开来,引入了一种训练友好的对抗攻击算法,同时保持了模型对干净数据的标准性能,从而提高了模型的鲁棒性。
Abstract
adversarial training
is an effective method to boost
model robustness
to malicious, adversarial attacks. However, such improvement in
model robus
→