Shenbin Qian, Archchana Sindhujan, Minnie Kabra, Diptesh Kanojia, Constantin Orăsan...
TL;DR本研究探讨了大型语言模型(LLMs)在机器翻译(MT)评估中所需的翻译信息,包括来源、参考、翻译错误和注释指南。研究结果表明,参考翻译对LLM的评估至关重要,同时发现Chain of Thought(CoT)提示对较大模型的影响更为显著,为资源受限的LLM评估提供了全面分析。
Abstract
Leveraging Large Language Models (LLMs) for various natural language processing tasks has led to superlative claims about their performance. For the Evaluation of →