基于VQA的视觉基础方法的负面案例分析

Apr, 2020

基于VQA的视觉基础方法的负面案例分析

A negative case analysis of visual grounding methods for VQA

Robik Shrestha, Kushal Kafle, Christopher Kanan

TL;DR为解决当前 VQA 方法依赖于数据集偏见和统计相关性的问题，提出了利用视觉提示进行VQA模型性能优化的方法，但实现的改进并不是由于视觉提示的影响，而是正则化效应，为此提出了一种不需要外部数据注释的简单正则化方案。

Abstract

Existing visual question answering (VQA) methods tend to exploit dataset biases and spurious statistical correlations, instead of producing right answers for the right reasons. To address this issue, recent bias mitigat