BriefGPT.xyz
Jul, 2023
MMBench: 您的多模型是否是全能选手?
MMBench: Is Your Multi-modal Model an All-around Player?
HTML
PDF
Yuan Liu, Haodong Duan, Yuanhan Zhang, Bo Li, Songyang Zhang...
TL;DR
提出了一种新的多模式基准测试方法MMBench,通过精心策划的数据集和结合CircularEval策略和ChatGPT的方法来对大视觉语言模型进行综合评估,旨在帮助研究社区更好地评估其模型以及鼓励未来的进步。
Abstract
large vision-language models
have recently achieved remarkable progress, exhibiting great perception and reasoning abilities concerning visual information. However, how to effectively evaluate these
large vision-languag
→