BriefGPT.xyz
Jul, 2019
关于自然语言生成评估中自动度量标准进行更好验证研究
On conducting better validation studies of automatic metrics in natural language generation evaluation
HTML
PDF
Johnny Tian-Zheng Wei
TL;DR
本文主要探讨自然语言生成领域中测评方法中的自动指标的应用和验证,提出了验证研究的最佳实践,并在WMT'17度量共享任务中进行了分析,同时也突出了未来的发展方向。
Abstract
natural language generation
(NLG) has received increasing attention, which has highlighted evaluation as a central methodological concern. Since human evaluations for these systems are costly,
automatic metrics
h
→