BriefGPT.xyz
Nov, 2020
离线强化学习实践
Offline Reinforcement Learning Hands-On
HTML
PDF
Louis Monier, Jakub Kmec, Alexandre Laterre, Thomas Pierrot, Valentin Courgeau...
TL;DR
此研究聚焦于离线强化学习,重点是离线学习方法的数据集属性和离线方法的成功相关性,实验证明离线RL的多样性和高回报的例子对于成功至关重要,并表明行为克隆仍然是竞争对手。
Abstract
offline reinforcement learning
(RL) aims to turn large
datasets
into powerful
decision-making engines
without any online interactions with
→