BriefGPT.xyz
Jun, 2024
MoreauPruner:针对权重扰动的大型语言模型稳健修剪
MoreauPruner: Robust Pruning of Large Language Models against Weight Perturbations
HTML
PDF
Zixiao Wang, Jingwei Zhang, Wenqian Zhao, Farzan Farnia, Bei Yu
TL;DR
在大型语言模型中,考虑到模型权重的扰动效应,我们通过优化分析和Moreau包络来提出了一种名为MoreauPruner的结构剪枝方法,能够稳定地对模型进行剪枝,并成功地与其他几种剪枝方法进行了比较。
Abstract
few-shot gradient methods
have been extensively utilized in existing
model pruning
methods, where the model weights are regarded as static values and the effects of potential
→