BriefGPT.xyz
Oct, 2020
使用生成模型的可传递通用对抗扰动
Transferable Universal Adversarial Perturbations Using Generative Models
HTML
PDF
Atiye Sadat Hashemi, Andreas Bär, Saeed Mozaffari, Tim Fingscheidt
TL;DR
本文提出了一种针对深度网络易受对抗扰动攻击的问题的方法,使用预训练网络生成图像无关对抗扰动(UAPs),并通过在源模型的第一层仅关注对抗能量的损失,提高了UAPs的传递性,并在实验中展示了该方法的优越性。
Abstract
Deep neural networks tend to be vulnerable to
adversarial perturbations
, which by adding to a natural image can fool a respective model with high confidence. Recently, the existence of image-agnostic perturbations, also known as universal
→