BriefGPT.xyz
Jun, 2023
规范层是Sharpness-Aware最小化的全部需求
Normalization Layers Are All That Sharpness-Aware Minimization Needs
HTML
PDF
Maximilian Mueller, Tiffany Vlaar, David Rolnick, Matthias Hein
TL;DR
本文研究了对小部分参数进行扰动的Sharpness-aware minimization (SAM)的性能,并通过实验结果证明,只操作正规化处理中几乎不占比例的仿射变换参数能比全局扰动得到更好的效果。
Abstract
sharpness-aware minimization
(SAM) was proposed to reduce sharpness of minima and has been shown to enhance
generalization performance
in various settings. In this work we show that perturbing only the
→