BriefGPT.xyz
Jul, 2021
图形生成建模用于视觉问答中的外部分布泛化
X-GGM: Graph Generative Modeling for Out-of-Distribution Generalization in Visual Question Answering
HTML
PDF
Jingjing Jiang, Ziyi Liu, Yifan Liu, Zhixiong Nan, Nanning Zheng
TL;DR
本文提出一种基于图生成建模的VQA模型,通过使用属性-对象对作为节点,逐步生成关系矩阵和节点表示来解决VQA中的OOD泛化问题,并在两个标准VQA OOD基准测试中实现了最先进的性能。
Abstract
Encouraging progress has been made towards
visual question answering
(VQA) in recent years, but it is still challenging to enable VQA models to adaptively generalize to
out-of-distribution
(OOD) samples. Intuitiv
→