冗长语言模型输出的影响：翻译评估的案例研究

Oct, 2024

On the Implications of Verbose LLM Outputs: A Case Study in Translation Evaluation

Eleftheria Briakou, Zhongtao Liu, Colin Cherry, Markus Freitag

TL;DR本研究探讨了冗长语言模型翻译对评估的影响，指出了在机器翻译中冗长输出的普遍存在及其主要触发因素，如安全性、版权问题与输入查询上下文不足。研究发现，在评估中忽视这一现象会不公平地惩罚输出更冗长的语言模型，从而强调了未来评估准确性的重要性。

Abstract

This paper investigates the impact of verbose LLM translations on Evaluation. We first demonstrate the prevalence of this behavior across several