BriefGPT.xyz
May, 2023
协作世界模型:一种在线-离线迁移强化学习方法
Collaborative World Models: An Online-Offline Transfer RL Approach
HTML
PDF
Qi Wang, Junming Yang, Yunbo Wang, Xin Jin, Wenjun Zeng...
TL;DR
该研究提出了一种称为协作世界模型(CoWorld)的转移学习方法,在离线数据集下为视觉强化学习模型提高性能,并成功缓解了价值函数的过高估计问题。
Abstract
Training visual reinforcement learning (RL) models in offline datasets is challenging due to overfitting issues in
representation learning
and overestimation problems in
value function
. In this paper, we propose
→