BriefGPT.xyz
Jul, 2022
鉴别器指导的基于模型的离线模仿学习
Discriminator-Guided Model-Based Offline Imitation Learning
HTML
PDF
Wenjia Zhang, Haoran Xu, Haoyi Niu, Peng Cheng, Ming Li...
TL;DR
该论文提出了一种基于鉴别器指导的模型辅助离线仿真学习框架,该框架采用协作对抗学习策略,能够显著提高在小数据集下的性能和鲁棒性。
Abstract
offline imitation learning
(IL) is a powerful method to solve decision-making problems from
expert demonstrations
without reward labels. Existing offline IL methods suffer from severe performance degeneration und
→