BriefGPT.xyz
Nov, 2023
大型语言模型的训练、微调和推理的运行时性能剖析
Dissecting the Runtime Performance of the Training, Fine-tuning, and Inference of Large Language Models
HTML
PDF
Longteng Zhang, Xiang Liu, Zeyu Li, Xinglin Pan, Peijie Dong...
TL;DR
通过针对大型语言模型的预训练、微调和运行时性能进行细致的分析和基准测试,本研究旨在为用户和研究人员提供对于配置选择以及优化性能的不同方法、框架和硬件平台的理解。
Abstract
large language models
(LLMs) have seen great advance in both academia and industry, and their popularity results in numerous open-source frameworks and techniques in accelerating LLM
pre-training
,
→