BriefGPT.xyz
Jan, 2024
通过控制有效的规划视野进行离线模仿学习
Offline Imitation Learning by Controlling the Effective Planning Horizon
HTML
PDF
Hee-Jun Ahn, Seong-Woong Shim, Byung-Jun Lee
TL;DR
通过控制有效计划视域,我们纠正了常见离线模仿学习算法中的近似误差问题,从而提升了算法的性能。
Abstract
In
offline imitation learning
(IL), we generally assume only a handful of
expert trajectories
and a supplementary
offline dataset
from sub
→