BriefGPT.xyz
Jun, 2023
Vid2Act:激活离线视觉强化学习视频
Vid2Act: Activate Offline Videos for Visual RL
HTML
PDF
Pan Minting, Zheng Yitao, Wang Yunbo, Yang Xiaokang
TL;DR
Vid2Act是一种基于模型的强化学习方法,其使用世界模型作为行为学习的模拟器并使用它们来衡量动力学表示转移和策略转移的域相关性,以将有价值的动作条件动态和潜在有用的行动演示从离线到在线环境进行转移。
Abstract
pretraining rl models
on
offline video datasets
is a promising way to improve their training efficiency in online tasks, but challenging due to the inherent mismatch in tasks, dynamics, and behaviors across domai
→