BriefGPT.xyz
May, 2017
深度策略对抗攻击探究
Delving into adversarial attacks on deep policies
HTML
PDF
Jernej Kos, Dawn Song
TL;DR
本文探究了深度强化学习中的对抗攻击,比较了使用对抗样本和随机噪声攻击的有效性,并提出了一种新的基于价值函数的方法来降低攻击的成功次数。此外,本文还研究了随机噪声和FGSM扰动对对抗攻击韧性的影响。
Abstract
adversarial examples
have been shown to exist for a variety of deep learning architectures.
deep reinforcement learning
has shown promising results on training agent policies directly on raw inputs such as image
→