BriefGPT.xyz
Sep, 2024
低比特大型语言模型的调研:基础、系统与算法
A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms
HTML
PDF
Ruihao Gong, Yifu Ding, Zining Wang, Chengtao Lv, Xingyu Zheng...
TL;DR
本文针对大型语言模型在实际应用中面临的高内存和计算需求问题,提出了低比特量化作为解决方案。通过系统地总结低比特量化的方法和实现,提供了基础概念、系统框架及高效训练与推理技术的深入分析,指出未来低比特大型语言模型发展的潜力和趋势。
Abstract
Large Language Models
(LLMs) have achieved remarkable advancements in
Natural Language Processing
, showcasing exceptional performance across various tasks. However, the expensive memory and computational requirem
→