BriefGPT.xyz
Feb, 2024
大型语言模型:一份调查报告
Large Language Models: A Survey
HTML
PDF
Shervin Minaee, Tomas Mikolov, Narjes Nikzad, Meysam Chenaghlu, Richard Socher...
TL;DR
对大型语言模型(LLMs)进行了综述,包括三个流行的LLM系列(GPT,LLaMA,PaLM)的特点、贡献和局限性,同时讨论了构建和增强LLMs的技术、为LLM训练、微调和评估准备的常用数据集以及常用的LLM评估指标,最后讨论了未来的挑战和研究方向。
Abstract
large language models
(
llms
) have drawn a lot of attention due to their strong performance on a wide range of natural language tasks, since the release of ChatGPT in November 2022.
→