深度策略对抗攻击探究

May, 2017

Delving into adversarial attacks on deep policies

Jernej Kos, Dawn Song

TL;DR本文探究了深度强化学习中的对抗攻击，比较了使用对抗样本和随机噪声攻击的有效性，并提出了一种新的基于价值函数的方法来降低攻击的成功次数。此外，本文还研究了随机噪声和FGSM扰动对对抗攻击韧性的影响。

Abstract

adversarial examples have been shown to exist for a variety of deep learning architectures. deep reinforcement learning has shown promising results on training agent policies directly on raw inputs such as image