BriefGPT.xyz
Dec, 2021
通过生成模型实现鲁棒强化学习的样本复杂性
Sample Complexity of Robust Reinforcement Learning with a Generative Model
HTML
PDF
Kishan Panaganti, Dileep Kalathil
TL;DR
该研究提出了一种基于模型的强化学习算法,用于学习在标准和不确定的模型下最优的稳健控制策略,并考虑了不同形式的不确定性集合
Abstract
The
robust markov decision process
(RMDP) framework focuses on designing
control policies
that are robust against the
parameter uncertainties
→