BriefGPT.xyz
Jan, 2025
语音基础模型的有效和高效混合精度量化
Effective and Efficient Mixed Precision Quantization of Speech Foundation Models
HTML
PDF
Haoning Xu, Zhaoqing Li, Zengrui Jin, Huimeng Wang, Youjun Chen...
TL;DR
本文针对语音基础模型的量化过程中的效率问题,提出了一种新颖的混合精度量化方法,将混合精度学习和量化模型参数估计整合为一个模型压缩阶段。研究结果表明,该方法在不提高单词错误率的情况下,显著提升了压缩比和减少了压缩时间,展示了在实际应用中的巨大潜力。
Abstract
This paper presents a novel mixed-precision
Quantization
approach for speech foundation models that tightly integrates mixed-precision learning and quantized model parameter estimation into one single
Model Compression<
→