BriefGPT.xyz
Dec, 2023
好问题促进零射影像推理
Good Questions Help Zero-Shot Image Reasoning
HTML
PDF
Kaiwen Yang, Tao Shen, Xinmei Tian, Xiubo Geng, Chongyang Tao...
TL;DR
通过引入问题驱动的视觉探索 (QVix),可以增强大型视觉语言模型 (LVLMs) 在零样本推理任务中的探索能力,提高其推理准确性和深度。
Abstract
Aligning the recent
large language models
(LLMs) with
computer vision models
leads to large vision-language models (LVLMs), which have paved the way for
→