BriefGPT.xyz
Sep, 2021
隐式行为克隆
Implicit Behavioral Cloning
HTML
PDF
Pete Florence, Corey Lynch, Andy Zeng, Oscar Ramirez, Ayzaan Wahid...
TL;DR
在机器人策略学习中,使用隐式模型的监督策略学习通常表现更好,这种策略不需要奖励信息,可以学习复杂的行为,并能够在具有高度组合复杂性和毫米级精度要求的任务中学习人类示范的复杂行为。
Abstract
We find that across a wide range of
robot policy learning
scenarios, treating supervised policy learning with an
implicit model
generally performs better, on average, than commonly used explicit models. We presen
→