BriefGPT.xyz
Jun, 2024
BlockPruner:大型语言模型的细粒度剪枝
BlockPruner: Fine-grained Pruning for Large Language Models
HTML
PDF
Longguang Zhong, Fanqi Wan, Ruijun Chen, Xiaojun Quan, Liangzhi Li
TL;DR
我们提出了一种名为BlockPruner的新型无需训练的结构化修剪方法,通过定位多头注意力和多层感知机块中的冗余实现更精细的修剪,实验证明,与现有方法相比,BlockPruner在各种下游任务中实现了更精确和有效的修剪。
Abstract
With the rapid growth in the size and complexity of large
language models
(LLMs), the costs associated with their training and inference have escalated significantly. Research indicates that certain layers in LLMs harbor substantial
→