近期大型语言模型在低资源语言中的表现

Jul, 2024

近期大型语言模型在低资源语言中的表现

Performance of Recent Large Language Models for a Low-Resourced Language

Ravindu Jayakody, Gihan Dias

TL;DR本研究聚焦于大型语言模型在低资源语言（如僧伽罗语）中的表现，填补了此领域的研究空白。通过评估四种最新的语言模型，发现Claude和GPT 4o在直接处理僧伽罗语及其英译方面表现优异，显著优于前版本，而Llama和Mistral虽表现不佳，但在微调后具有改进潜力。该研究为低资源语言处理提供了新的见解和实用模型选择。

Abstract

Large Language Models (LLMs) have shown significant advances in the past year. In addition to new versions of GPT and Llama, several other LLMs have been introduced recently. Some of these are open models available for download and modification. Although multilingual →