BriefGPT.xyz
Oct, 2024
在深度模型合并技术中寻找损失景观的共同点
SoK: On Finding Common Ground in Loss Landscapes Using Deep Model Merging Techniques
HTML
PDF
Arham Khan, Todd Nief, Nathaniel Hudson, Mansi Sakarvadia, Daniel Grzenda...
TL;DR
本研究解决了当前神经网络可解释性研究在理解模型训练行为及其任务特定行为方面的显著不足。通过分析模型合并文献并结合损失景观几何学的视角,提供了一种新的分类方法和四个主要方面的特征化,这些发现为提高机器学习的安全性和可信赖性奠定了基础。
Abstract
Understanding
Neural networks
is crucial to creating reliable and trustworthy deep learning models. Most contemporary research in
Interpretability
analyzes just one model at a time via causal intervention or acti
→