BriefGPT.xyz
May, 2023
视觉问答中语言模态的实证研究
An Empirical Study on the Language Modal in Visual Question Answering
HTML
PDF
Daowan Peng, Wei Wei, Xian-Ling Mao, Yuanyuan Fu, Dangyang Chen
TL;DR
本文通过一系列实验,探究语言模态对视觉问答模型在超出其学习领域的数据上的影响,提出简单的方法来减少模型对语言先验的依赖并在out-of-distribution测试集上提高性能。
Abstract
Generalization beyond in-domain experience to
out-of-distribution data
is of paramount significance in the AI domain. Of late, state-of-the-art
visual question answering
(VQA) models have shown impressive perform
→