视觉问答中的场景图推理

Jul, 2020

Scene Graph Reasoning for Visual Question Answering

Marcel Hildebrandt, Hang Li, Rajat Koner, Volker Tresp, Stephan Günnemann

TL;DR我们提出了一种基于场景图和强化学习的方法来解决视觉问答任务，实验结果表明该方法在GQA数据集上已达到接近人类水平的效果。

Abstract

visual question answering is concerned with answering free-form questions about an image. Since it requires a deep linguistic understanding of the question and the ability to associate it with various objects that are present in the image, it is an ambitious task and requires technique