BriefGPT.xyz
Nov, 2022
基于视觉语言模型的指令增强机器人技能习得
Robotic Skill Acquisition via Instruction Augmentation with Vision-Language Models
HTML
PDF
Ted Xiao, Harris Chan, Pierre Sermanet, Ayzaan Wahid, Anthony Brohan...
TL;DR
本论文介绍了一种名为DIAL的方法,利用半监督的语言标签,结合CLIP的语义理解,将知识传播到大型未标记的数据集中,并在增强的数据集上训练语言条件下的策略,从而使模仿学习策略获得新的能力并推广到原始数据集中未见过的60个新指令。
Abstract
In recent years, much progress has been made in learning
robotic manipulation
policies that follow
natural language instructions
. Such methods typically learn from corpora of robot-language data that was either c
→