BriefGPT.xyz
Oct, 2016
视觉问答:数据集、算法和未来挑战
Visual Question Answering: Datasets, Algorithms, and Future Challenges
HTML
PDF
Kushal Kafle, Christopher Kanan
TL;DR
本文回顾了近年来计算机视觉和自然语言处理领域关于视觉问答(VQA)的研究,包括问题定义、数据集、算法和评估指标,并深入探讨了当前数据集在训练和评估VQA算法方面的局限性,全面回顾了现有的VQA算法,最后讨论了VQA和图像理解研究的可能未来方向。
Abstract
visual question answering
(VQA) is a recent problem in
computer vision
and
natural language processing
that has garnered a large amount of
→