视觉注意力下的多对象识别

Dec, 2014

Multiple Object Recognition with Visual Attention

Jimmy Ba, Volodymyr Mnih, Koray Kavukcuoglu

TL;DR提出使用基于注意力模型的深度循环神经网络，加上强化学习训练，以识别并定位输入图像中的多个目标。通过在谷歌街景图像中转录房屋编号序列的挑战性任务上的测试，表明该模型精度高于现有最先进的卷积神经网络，且使用的参数和计算量较少。

Abstract

We present an attention-based model for recognizing multiple objects in images. The proposed model is a deep recurrent neural network trained with →