带有噪声引导的主动模仿学习

May, 2020

Active Imitation Learning with Noisy Guidance

Kianté Brantley, Amr Sharaf, Hal Daumé III

TL;DRLEAQI算法利用差异分类器在序列标注任务中替代了昂贵、低效的查询过程，实现了更好的查询效果和准确度。

Abstract

imitation learning algorithms provide state-of-the-art results on many structured prediction tasks by learning near-optimal search policies. Such algorithms assume training-time access to an expert that can provide the optimal action at any queried state; unfortunately, the number of s