分析大型语言模型在文档级翻译中的上下文利用情况

Oct, 2024

分析大型语言模型在文档级翻译中的上下文利用情况

Analyzing Context Utilization of LLMs in Document-Level Translation

Wafaa Mohammed, Vlad Niculae

TL;DR本研究解决了大型语言模型在文档级翻译中缺乏上下文利用的问题。通过分析模型对扰动和随机化文档上下文的鲁棒性，提出了针对上下文相关部分的精细调优策略，以提升模型的翻译可靠性。研究发现，尽管文档翻译性能有所提高，但代词翻译表现未必相应改善，突显了该领域的改进需求。

Abstract

Large Language Models (LLM) are increasingly strong contenders in machine translation. We study Document-Level Translation, where some words cannot be translated without context from outside the sentence. We inve