BriefGPT.xyz
Jun, 2024
小型语言模型中实现稀疏激活
Achieving Sparse Activation in Small Language Models
HTML
PDF
Jifeng Song, Kai Huang, Xiangyu Yin, Boyuan Yang, Wei Gao
TL;DR
我们在小型语言模型(SLMs)中实现了稀疏激活,并通过新的归因测量指标以达到精确的稀疏激活,实验证明我们的方法可以在只损失<5%的模型准确性的情况下实现80%的稀疏化比率,可与大型语言模型(LLMs)中实现的稀疏激活相媲美。
Abstract
sparse activation
, which selectively activates only an input-dependent set of
neurons
in inference, is a useful technique to reduce the computing cost of Large
→