BriefGPT.xyz
Aug, 2024
基于奇异值的自适应低秩适应
SARA: Singular-Value Based Adaptive Low-Rank Adaption
HTML
PDF
Jihao Gu, Shuai Chen, Zelin Wang, Yibo Zhang, Ping Gong
TL;DR
本研究解决了在大规模预训练模型中,LoRA方法需要手动验证不同层级的秩值匹配不同下游任务的问题。通过奇异值分解分析层与秩之间的关系,提出了SARA方法,能够在初始化时自适应地找到适合的秩;另外,通过探索Mixture-of-SARA方法,大幅度减少参数数量,验证了方法的简单性和参数效率。
Abstract
With the increasing number of parameters in large pre-trained models, LoRA as a parameter-efficient
Fine-Tuning
(PEFT) method is widely used for not adding inference overhead. The LoRA method assumes that weight changes during
→