BriefGPT.xyz
Jun, 2018
学习条件化图结构以进行可解释的视觉问答
Learning Conditioned Graph Structures for Interpretable Visual Question Answering
HTML
PDF
Will Norcliffe-Brown, Efstathios Vafeais, Sarah Parisot
TL;DR
本论文提出了一种基于图形的视觉问答新方法,该方法结合了用于学习问题特定图形表示的图形学习器模块和最近的图形卷积概念,旨在学习能够捕捉问题特定交互的图像表示。该方法在VQA v2数据集上获得了66.18%的准确率,证明了其可解释性。
Abstract
visual question answering
is a challenging problem requiring a combination of concepts from
computer vision
and
natural language processing
→