量化如何影响多语言LLMs？

Jul, 2024

How Does Quantization Affect Multilingual LLMs?

Kelly Marchisio, Saurabh Dash, Hongyu Chen, Dennis Aumiller, Ahmet Üstün...

TL;DR量化、多语言LLMs的性能、语言、评估

Abstract

quantization techniques are widely used to improve inference speed and deployment of large language models. While a wide body of work examines the impact of quantized LLMs on English tasks, none have examined the effect of →