BriefGPT.xyz
Oct, 2023
探索零样本视觉问答的问题分解
Exploring Question Decomposition for Zero-Shot VQA
HTML
PDF
Zaid Khan, Vijay Kumar BG, Samuel Schulter, Manmohan Chandraker, Yun Fu
TL;DR
通过研究和应用视觉-语言模型,本文提出了问题分解策略和模型驱动的选择性分解方法,以提高视觉问答任务的准确性和性能。
Abstract
visual question answering
(VQA) has traditionally been treated as a single-step task where each question receives the same amount of effort, unlike natural human question-answering strategies. We explore a
question deco
→