引入cosmosGPT：针对土耳其语言模型的单语言训练

Apr, 2024

引入cosmosGPT：针对土耳其语言模型的单语言训练

Introducing cosmosGPT: Monolingual Training for Turkish Language Models

H. Toprak Kesgin, M. Kaan Yuce, Eren Dogan, M. Egemen Uzun, Atahan Uz...

TL;DR通过用纯土耳其语语料库训练建立的cosmosGPT模型和适应土耳其语的语言模型的全面比较，研究结果显示，尽管相较于其他模型，我们用单语料库建立的语言模型规模较小约10倍，但其表现仍然有可观的性能。

Abstract

The number of open source language models that can produce Turkish is increasing day by day, as in other languages. In order to create the basic versions of such models, the training of multilingual models is usually continued with →