BriefGPT.xyz
Dec, 2023
基于检测的视觉问答中间监督
Detection-based Intermediate Supervision for Visual Question Answering
HTML
PDF
Yuhang Liu, Daowan Peng, Wei Wei, Yuanyuan Fu, Wenfeng Xie...
TL;DR
采用检测为基础的中间监督方法(DIS)来提供更全面和准确的中间监督,从而提升了回答推理性问题的性能,并通过考虑中间结果来增强了回答复合问题及其子问题的一致性。
Abstract
Recently,
neural module networks
(NMNs) have yielded ongoing success in answering compositional
visual questions
, especially those involving multi-hop visual and logical reasoning. NMNs decompose the complex ques
→