BriefGPT.xyz
May, 2021
Dynaboard: 一款全面的下一代基准评估即服务平台
Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking
HTML
PDF
Zhiyi Ma, Kawin Ethayarajh, Tristan Thrush, Somya Jain, Ledell Wu...
TL;DR
Dynaboard 是一个评估即服务框架,集成于 Dynabench 平台,评估 NLP 模型的质量和性能,并使用基于用户定制的 Dynascore 统计综合评估指标,帮助用户更好地评估模型质量。
Abstract
We introduce
dynaboard
, an
evaluation-as-a-service
framework for hosting benchmarks and conducting holistic model comparison, integrated with the Dynabench platform. Our platform evaluates
→