BriefGPT.xyz
May, 2024
SciFIBench:科学图表解读大型多模态模型基准测试
SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
HTML
PDF
Jonathan Roberts, Kai Han, Neil Houlsby, Samuel Albanie
TL;DR
SciFIBench是一个科学图表解释的基准测试,评估了26个大型多模态模型在理解和解释图表方面的能力,并探究了模型在拓展问题集上的对齐和推理准确性。
Abstract
large multimodal models
(LMMs) have proven flexible and generalisable across many tasks and fields. Although they have strong potential to aid
scientific research
, their capabilities in this domain are not well c
→