BriefGPT.xyz
May, 2024
VDGD:通过弥合视觉感知差距来减轻认知提示中的低可信度语言幻觉
VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap
HTML
PDF
Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, Oriol Nieto...
TL;DR
对大型视觉语言模型(LVLMs)的幻觉问题进行了深入分析,发现了几个新的洞察力,提出了一种简单、稳健和无需训练的方法(VDGD)来减轻幻觉,实验结果表明VDGD在减少幻觉方面显著优于其他基线方法。
Abstract
Recent interest in
large vision-language models
(LVLMs) for practical applications is moderated by the significant challenge of
hallucination
or the inconsistency between the factual information and the generated
→