BriefGPT.xyz
Aug, 2024
大模型中的模型合并:方法、理论、应用与机遇
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities
HTML
PDF
Enneng Yang, Li Shen, Guibing Guo, Xingwei Wang, Xiaochun Cao...
TL;DR
本研究针对当前文献中缺乏系统的模型合并方法综述这一问题,提出了一种新的分类方法来全面讨论现有的模型合并技术。研究结果指出,模型合并在大型语言模型和多模态语言模型等多个领域中具有广泛的应用潜力,同时也面临若干挑战,亟需未来研究探索。
Abstract
Model Merging
is an efficient empowerment technique in the
Machine Learning
community that does not require the collection of raw training data and does not require expensive computation. As
→