BriefGPT.xyz
Oct, 2023
等变深度权重空间对齐
Equivariant Deep Weight Space Alignment
HTML
PDF
Aviv Navon, Aviv Shamsian, Ethan Fetaya, Gal Chechik, Nadav Dym...
TL;DR
通过学习解决权重对齐问题的新框架Deep-Align,该研究提出了深度网络的排列对称性和权重排列两个基本对称性,并在多个网络架构和学习设置上进行了实验,结果显示Deep-Align能够产生与当前优化算法相比更好或相等的对齐,并可用作其他方法的初始化,以实现更好的解决方案和显著加速收敛速度。
Abstract
permutation symmetries
of
deep networks
make simple operations like model averaging and similarity estimation challenging. In many cases, aligning the weights of the networks, i.e., finding optimal permutations b
→