BriefGPT.xyz
Jan, 2021
基于学习得到的优化对手的状态观测下鲁棒强化学习
Robust Reinforcement Learning on State Observations with Learned Optimal Adversary
HTML
PDF
Huan Zhang, Hongge Chen, Duane Boning, Cho-Jui Hsieh
TL;DR
研究了强化学习在面对敌对攻击(即使状态的扰动)时的鲁棒性,并提出了一种基于ATLA框架的方法来增强Agent的鲁棒性,通过训练online的对抗学习可以达到最优敌对攻击框架与提前学习历史数据等手段,从而提高强化学习在实验中的表现。
Abstract
We study the
robustness
of
reinforcement learning
(RL) with adversarially perturbed
state observations
, which aligns with the setting of m
→