BriefGPT.xyz
Jun, 2022
视频预训练(VPT):通过观看未标记的在线视频学习行为
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
HTML
PDF
Bowen Baker, Ilge Akkaya, Peter Zhokhov, Joost Huizinga, Jie Tang...
TL;DR
该研究探索了如何利用半监督式模仿学习的方法,在游戏领域中通过预训练行为先验模型来实现强化学习,从而达到人类甚至更高的行为水平。
Abstract
pretraining
on noisy, internet-scale datasets has been heavily studied as a technique for training models with broad, general capabilities for text, images, and other modalities. However, for many
sequential decision do
→