BriefGPT.xyz
Dec, 2024
Qwen2.5技术报告
Qwen2.5 Technical Report
HTML
PDF
Qwen, :, An Yang, Baosong Yang, Beichen Zhang...
TL;DR
本研究解决了大型语言模型(LLMs)在多样化需求满足上的不足,提出了Qwen2.5这一系列模型。通过扩大预训练数据集和实施多阶段强化学习,本论文显著提升了模型在长文本生成、结构数据分析和指令遵循等方面的表现,且在多个基准测试中展现了卓越的性能。
Abstract
In this report, we introduce Qwen2.5, a comprehensive series of
large language models
(LLMs) designed to meet diverse needs. Compared to previous iterations, Qwen 2.5 has been significantly improved during both the
pre-
→