BriefGPT.xyz
Sep, 2022
深度强化学习的白盒对抗策略
White-Box Adversarial Policies in Deep Reinforcement Learning
HTML
PDF
Stephen Casper, Dylan Hadfield-Menell, Gabriel Kreiman
TL;DR
本文研究白盒子对抗策略的效果,发现黑盒子对抗相对于对抗策略而言效果较差,训练白盒子对抗可以提高单 agent 环境的鲁棒性。
Abstract
adversarial examples
against AI systems pose both risks via malicious attacks and opportunities for improving
robustness
via adversarial training. In
→