BriefGPT.xyz
Mar, 2022
离线强化学习综述:分类、评估与开放性问题
A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open Problems
HTML
PDF
Rafael Figueiredo Prudencio, Marcos R. O. A. Maximo, Esther Luna Colombini
TL;DR
本论文提出一个在线学习和离线学习技术的归一化分类法,总结了离线RL领域的最新算法突破和现有基准的特性和不足,并提供了对未来研究方向的展望。
Abstract
With the widespread adoption of
deep learning
,
reinforcement learning
(RL) has experienced a dramatic increase in popularity, scaling to previously intractable problems, such as playing complex games from pixel o
→