主动一次性学习

Feb, 2017

Active One-shot Learning

Mark Woodward, Chelsea Finn

TL;DR使用强化学习和单样本学习相结合的方法，使得模型能够在分类过程中决定哪些样本需要标注，我们提出了一种基于递归神经网络的动作值函数来实现，通过选择奖励函数，该模型能够在减少样本标注需求的同时达到更高的准确率。

Abstract

Recent advances in one-shot learning have produced models that can learn from a handful of labeled examples, for passive classification and regression tasks. This paper combines reinforcement learning with