BriefGPT.xyz
Jan, 2024
MERA: 俄语中的综合语言水平评估
MERA: A Comprehensive LLM Evaluation in Russian
HTML
PDF
Alena Fenogenova, Artem Chervyakov, Nikita Martynov, Anastasia Kozlova, Maria Tikhonova...
TL;DR
通过引入一个新的用于评估基础模型的多模态俄语架构(MERA),本文介绍了一种在零点和少点固定指令设置下评估基础模型和语言模型的方法论,该方法论可以扩展到其他模态,在评估开放式语言模型的基线时发现其仍远落后于人类水平。
Abstract
Over the past few years, one of the most notable advancements in AI research has been in
foundation models
(FMs), headlined by the rise of
language models
(LMs). As the models' size increases, LMs demonstrate enh
→