In this paper, we introduce Nested Low-Rank Adaptation (NoRA), a novel approach to parameter-efficient fine-tuning that extends the capabilities of Low-Rank Adaptation (LoRA) techniques. Vanilla LoRA overlooks pre-trained weight inheritance and still requires fine-tuning numerous parameters. To addresses these issues, our NoRA adopts a dual-layer nested structure with Singular Value Decomposition (SVD), effectively leveraging original matrix knowledge while reducing tunable parameters. Specifically, NoRA freezes the outer LoRA weights and utilizes an inner LoRA design, providing enhanced control over model optimization. This approach allows the model to more precisely adapt to specific tasks while maintaining a compact parameter space. By freezing outer LoRA weights and using an inner LoRA design, NoRA enables precise task adaptation with a compact parameter space. Evaluations on tasks including commonsense reasoning with large language models, fine-tuning vision-language models, and subject-driven generation demonstrate NoRA's superiority over LoRA and its variants. Notably, NoRA reduces fine-tuning parameters|training-time|memory-usage by 4\%|22.5\%|20.7\% compared to LoRA on LLaMA-3 8B, while achieving 2.2\% higher performance. Code will be released upon acceptance.

本文提出了一种新的参数高效微调方法——嵌套低秩适应（NoRA），旨在解决传统低秩适应（LoRA）在微调过程中参数数量过多和未充分利用预训练权重的问题。NoRA通过采用双层嵌套结构和奇异值分解（SVD），显著减少了可调参数数量，并在多项任务评估中表现出相较于LoRA及其变种更优的性能，降低了微调的参数、训练时间和内存使用，同时性能提升了2.2%。

高效微调大模型的嵌套低秩适应方法（NoRA）