BriefGPT.xyz
Oct, 2023
大型视觉语言模型中的对象幻觉分析与缓解
Analyzing and Mitigating Object Hallucination in Large Vision-Language Models
HTML
PDF
Yiyang Zhou, Chenhang Cui, Jaehong Yoon, Linjun Zhang, Zhun Deng...
TL;DR
LVLM Hallucination Revisor (LURE)是一种简单而强大的算法,通过重建较少产生幻觉的描述来修正LVLMs中的物体幻觉问题,从而提高视觉总结和推理等视觉语言任务的性能。
Abstract
large vision-language models
(LVLMs) have shown remarkable abilities in understanding visual information with human languages. However, LVLMs still suffer from
object hallucination
, which is the problem of genera
→