BriefGPT.xyz
Jun, 2024
DigiRL: 用自主的增强学习训练野外设备控制智能体
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning
HTML
PDF
Hao Bai, Yifei Zhou, Mert Cemri, Jiayi Pan, Alane Suhr...
TL;DR
通过在开放环境中独立训练具备决策能力的视觉语言模型,这篇论文提出了一种名为DigiRL的新型自主强化学习方法,在控制各种设备上取得了新的最佳效果。
Abstract
Training corpuses for
vision language models
(VLMs) typically lack sufficient amounts of
decision-centric data
. This renders off-the-shelf VLMs sub-optimal for decision-making tasks such as
→