BriefGPT.xyz
Dec, 2023
部分动力学知识的高效强化学习
Sample Efficient Reinforcement Learning with Partial Dynamics Knowledge
HTML
PDF
Meshal Alharbi, Mardavij Roozbehani, Munther Dahleh
TL;DR
本文研究在线强化学习的样本复杂性问题,并考虑了有关系统动态的部分知识,提出了一种基于Q-learning的算法,能够在具有有限Markov决策过程的系统中实现近似最优策略。
Abstract
The problem of
sample complexity
of
online reinforcement learning
is often studied in the literature without taking into account any partial knowledge about the
→