BriefGPT.xyz
Jul, 2023
通过在线回归进行选择性采样和模仿学习
Selective Sampling and Imitation Learning via Online Regression
HTML
PDF
Ayush Sekhari, Karthik Sridharan, Wen Sun, Runzhe Wu
TL;DR
本文提出了一种应用选择性抽样的交互式算法,可用于通过主动查询具有噪声的专家反馈实现模仿学习,并提供了关于后者的新算法,同时证明了该算法的后悔和查询复杂度在一定的理论范围内得到优化。
Abstract
We consider the problem of
imitation learning
(IL) by actively querying noisy expert for feedback. While
imitation learning
has been empirically successful, much of prior work assumes access to noiseless expert f
→