BriefGPT.xyz
May, 2024
Vikhr: 面向俄语的开源指令调优大语言模型家族
Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian
HTML
PDF
Aleksandr Nikolich, Konstantin Korolev, Artem Shelmanov
TL;DR
为了解决非英语文本生成的挑战,如生成质量差和计算性能下降等问题,本研究介绍了一种专为俄语设计的开源指令调整大型语言模型Vikhr,通过适应性分词词汇表、持续预训练和指令调整权重等方法,提高模型性能和计算效率,并在俄语基准测试中取得显著成果。
Abstract
There has been a surge in the development of various
large language models
(LLMs). However,
text generation
for languages other than English often faces significant challenges, including poor generation quality a
→