BriefGPT.xyz
Jun, 2022
基于视觉观测的离线强化学习中的挑战与机遇
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
HTML
PDF
Cong Lu, Philip J. Ball, Tim G. J. Rudner, Jack Parker-Holder, Michael A. Osborne...
TL;DR
本文旨在建立连续控制的视觉基线,通过离线强化学习从视角上建立简单的基线,并在数据集中严格评估算法,同时分析了离线视角下的重要的特殊需求。
Abstract
offline reinforcement learning
has shown great promise in leveraging large pre-collected datasets for policy learning, allowing agents to forgo often-expensive online data collection. However, to date,
offline reinforce
→