BriefGPT.xyz
Nov, 2023
通用策略的不变因果模仿学习
Invariant Causal Imitation Learning for Generalizable Policies
HTML
PDF
Ioana Bica, Daniel Jarrett, Mihaela van der Schaar
TL;DR
基于多个环境中的行为演示来学习模仿策略,通过学习跨域不变的特征表示,构建与专家行为匹配的模仿策略,以实现在未见环境中的泛化能力。
Abstract
Consider learning an
imitation policy
on the basis of demonstrated behavior from
multiple environments
, with an eye towards deployment in an unseen environment. Since the observable features from each setting may
→