BriefGPT.xyz
Nov, 2023
LM-Cocktail:通过模型合并实现语言模型的可靠调整
LM-Cocktail: Resilient Tuning of Language Models via Model Merging
HTML
PDF
Shitao Xiao, Zheng Liu, Peitian Zhang, Xingrun Xing
TL;DR
通过模型合并的方法(LM-Cocktail),将预训练语言模型与微调的模型通过加权平均的方式融合,以使得微调模型在一般任务中能够保持强大的实际性能,同时在特定领域具有优越的能力。
Abstract
The
pre-trained language models
are continually
fine-tuned
to better support downstream applications. However, this operation may result in significant performance degeneration on general tasks beyond the targete
→