BriefGPT.xyz
Jun, 2024
VLind-Bench:大型视觉-语言模型中的语言先验测量
VLind-Bench: Measuring Language Priors in Large Vision-Language Models
HTML
PDF
Kang-il Lee, Minbeom Kim, Seunghyun Yoon, Minsung Kim, Dongryeol Lee...
TL;DR
通过新的基准测试VLind-Bench,本研究评估和分析了近期的大型视觉语言模型(LVLMs),发现几乎所有模型都过度依赖于语言先验,这对该领域构成了巨大挑战。
Abstract
large vision-language models
(
lvlms
) have demonstrated outstanding performance across various multimodal tasks. However, they suffer from a problem known as
→