使用卷积神经网络从图像中学习答案

Jun, 2015

使用卷积神经网络从图像中学习答案

Learning to Answer Questions From Image using Convolutional Neural Network

Lin Ma, Zhengdong Lu, Hang Li

TL;DR本文提出使用卷积神经网络 (CNN) 解决图像问答 (QA) 问题，通过三个 CNN 模型来提升图像和问题共同表示的分类能力。经过 DAQUAR 和 COCO-QA 两个基准测试集的测试，本文的模型表现显著优于现有的最优解。

Abstract

In this paper, we propose to employ the convolutional neural network (CNN) for learning to answer questions from the image. Our proposed CNN provides an end-to-end framework for learning not only the image representation, the composition model for question, but also the inter-modal int