BriefGPT.xyz
Mar, 2018
VQA-E:为视觉问题解答进行解释、阐释和增强
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions
HTML
PDF
Qing Li, Qingyi Tao, Shafiq Joty, Jianfei Cai, Jiebo Luo
TL;DR
提出了VQA-E任务,要求计算机模型在预测答案的同时生成一个解释。通过多任务学习框架, VQA-E数据集从VQA v2数据集自动导出,用户研究表明,我们的方法可以生成有洞察力的文本句子来证明答案,并提高了答案预测的性能。
Abstract
Most existing works in
visual question answering
(VQA) are dedicated to improving the accuracy of predicted answers, while disregarding the
explanations
. We argue that the explanation for an answer is of the same
→