BriefGPT.xyz
Dec, 2021
CUGE:一个中文语言理解与生成的评估基准
CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark
HTML
PDF
Yuan Yao, Qingxiu Dong, Jian Guan, Boxi Cao, Zhengyan Zhang...
TL;DR
提出了针对自然语言处理中普适性语言智能评估的全面、系统的评估标准CUGE,并通过预训练的语言模型的评估结果表明还有改进的空间。
Abstract
Realizing
general-purpose language intelligence
has been a longstanding goal for
natural language processing
, where standard
evaluation benchmark
→