BriefGPT.xyz
Jul, 2023
具有概率策略执行不确定性的高效动作稳健强化学习
Efficient Action Robust Reinforcement Learning with Probabilistic Policy Execution Uncertainty
HTML
PDF
Guanin Liu, Zhihan Zhou, Han Liu, Lifeng Lai
TL;DR
本文探讨了具有概率性策略执行不确定性的抗干扰强化学习问题,并提出了 ARRLC 算法,该算法具有极小化最坏情况下收益损失和样本复杂性的性质,并在实验中验证了其在存在干扰情况下的稳健性。
Abstract
robust reinforcement learning
(RL) aims to find a policy that optimizes the worst-case performance in the face of uncertainties. In this paper, we focus on action robust RL with the
probabilistic policy execution uncert
→