BriefGPT.xyz
Jul, 2021
将技能与概念分离以进行新颖视觉问答
Separating Skills and Concepts for Novel Visual Question Answering
HTML
PDF
Spencer Whitehead, Hui Wu, Heng Ji, Rogerio Feris, Kate Saenko
TL;DR
提出了一种基于视觉任务的方法,将 VQA 问题分解为技能和概念,并通过对对应的概念表示和技能编码的解耦实现模型内部的有效组合,从而提高了处理新组合任务的能力。
Abstract
generalization
to out-of-distribution data has been a problem for
visual question answering
(VQA) models. To measure
generalization
to nov
→