BriefGPT.xyz
Jun, 2018
学习视觉问答的答案嵌入
Learning Answer Embeddings for Visual Question Answering
HTML
PDF
Hexiang Hu, Wei-Lun Chao, Fei Sha
TL;DR
该研究提出了一种新的概率模型,用于视觉问答中的多项选择,将嵌入视觉、问答和回答,并考虑到回答之间的语义关系,从而提高了对新问题的表现。
Abstract
We propose a novel
probabilistic model
for
visual question answering
(Visual QA). The key idea is to infer two sets of
embeddings
: one for
→