BriefGPT.xyz
Oct, 2023
标签污染就是您所需要的
Label Poisoning is All You Need
HTML
PDF
Rishi D. Jha, Jonathan Hayase, Sewoong Oh
TL;DR
通过corrupt labels设计的label-only backdoor attack方法FLIP,在几个数据集和架构上展示了其强大的攻击能力,并且只引起1.8%的clean test准确度下降。
Abstract
In a
backdoor attack
, an adversary injects
corrupted data
into a model's training dataset in order to gain control over its predictions on images with a specific attacker-defined trigger. A typical corrupted trai
→