BriefGPT.xyz
Mar, 2017
视觉问答算法分析
An Analysis of Visual Question Answering Algorithms
HTML
PDF
Kushal Kafle, Christopher Kanan
TL;DR
本文分析了现有的视觉问答(VQA)算法,并使用一个新数据集进行了评估,提出了新的评估方案来补偿过度展示的问题类型,并研究了不同算法的优缺点和注意力机制的作用。
Abstract
In
visual question answering
(VQA), an algorithm must answer text-based questions about images. While multiple datasets for VQA have been created since late 2014, they all have flaws in both their content and the way
al
→