BriefGPT.xyz
Oct, 2021
文本风格注意!基于文本风格转换的对抗和后门攻击
Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer
HTML
PDF
Fanchao Qi, Yangyi Chen, Xurui Zhang, Mukai Li, Zhiyuan Liu...
TL;DR
本研究首次尝试基于文本样式转换进行对抗性和后门攻击,设计了对抗性攻击和后门攻击方法,并进行了广泛实验来评估它们。实验结果表明,基于文本样式转换的对抗性和后门攻击方法优于基线模型,在许多方面都表现出卓越的优越性。
Abstract
adversarial attacks
and
backdoor attacks
are two common security threats that hang over deep learning. Both of them harness task-irrelevant features of data in their implementation. Text style is a feature that i
→