BriefGPT.xyz
Oct, 2021
探究预训练语言模型对对话评价的影响
Investigating the Impact of Pre-trained Language Models on Dialog Evaluation
HTML
PDF
Chen Zhang, Luis Fernando D'Haro, Yiming Chen, Thomas Friedrichs, Haizhou Li
TL;DR
本研究分析了8种不同的预训练语言模型对三个典型自动对话评估度量标准在三个不同对话评估基准上的表现,包括预训练目标,对话评估标准,模型尺寸和跨数据集的稳健性,为首次对不同预训练语言模型对自动对话性能影响的全面评估。
Abstract
Recently, there is a surge of interest in applying
pre-trained language models
(Pr-LM) in
automatic open-domain dialog evaluation
. Pr-LMs offer a promising direction for addressing the multi-domain evaluation cha
→