BriefGPT.xyz
Oct, 2023
ReForm-Eval: 通过任务导向基准的统一重新制定评估大型视觉语言模型
ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks
HTML
PDF
Zejun Li, Ye Wang, Mengfei Du, Qingwen Liu, Binhao Wu...
TL;DR
通过ReForm-Eval基准测试,我们对LVLM的各种能力进行了全面的定量评估,发现并分析了现有LVLM的优点和缺点,并确定了潜在的影响因素。
Abstract
Recent years have witnessed remarkable progress in the development of
large vision-language models
(
lvlms
). Benefiting from the strong language backbones and efficient
→