BriefGPT.xyz
Jul, 2023
大型语言模型综述
A Comprehensive Overview of Large Language Models
HTML
PDF
Humza Naveed, Asad Ullah Khan, Shi Qiu, Muhammad Saqib, Saeed Anwar...
TL;DR
本篇综述论文全面分析了大型语言模型的架构及其分类、训练策略、训练数据集和性能评估,并讨论了未来的研究方向,最后总结了大型语言模型研究的重要发现和关键的架构和训练策略。
Abstract
large language models
(LLMs) have shown excellent generalization capabilities that have led to the development of numerous models. These models propose various new
architectures
, tweaking existing
→