BriefGPT.xyz
May, 2022
输入空间到特征表示的无感后门攻击
Imperceptible Backdoor Attack: From Input Space to Feature Representation
HTML
PDF
Nan Zhong, Zhenxing Qian, Xinpeng Zhang
TL;DR
本文提出了一种新颖的隐形后门攻击方法,该方法通过将触发器模式视为一种特殊噪声并以伯努利分布生成参数,从而在不影响正常输入的情况下利用训练集合并夹杂恶意信息,并考虑对多种最新防御措施的效果验证。
Abstract
backdoor attacks
are rapidly emerging threats to
deep neural networks
(DNNs). In the backdoor attack scenario, attackers usually implant the backdoor into the target model by manipulating the training dataset or
→