BriefGPT.xyz
Apr, 2017
展示、询问、关注和回答:视觉问答的强大基线
Show, Ask, Attend, and Answer: A Strong Baseline For Visual Question Answering
HTML
PDF
Vahid Kazemi, Ali Elqursh
TL;DR
本文介绍了一种新的视觉问答任务的基线模型,它可以根据图像的内容和自然语言的问题准确地产生答案,并取得了在不平衡和平衡的VQA基准测试中的最新成果。
Abstract
This paper presents a new baseline for
visual question answering
task. Given an image and a question in natural language, our
model
produces accurate answers according to the content of the image. Our
→