BriefGPT.xyz
Sep, 2024
双边锐度感知最小化以获取更平坦的极小值
Bilateral Sharpness-Aware Minimization for Flatter Minima
HTML
PDF
Jiaxin Deng, Junbiao Pang, Baochang Zhang, Qingming Huang
TL;DR
本研究针对锐度感知最小化(SAM)在提升泛化能力过程中存在的“平坦指示器问题”进行了探讨,提出了双边SAM(BSAM)方法。通过引入当前权重周围邻域内训练损失与最小损失之间的差异,BSAM能够指导优化过程朝向更平坦的最小值,实验证明其在多项任务中的泛化性能和鲁棒性优于传统SAM。
Abstract
Sharpness-Aware Minimization
(SAM) enhances
Generalization
by reducing a Max-Sharpness (MaxS). Despite the practical success, we empirically found that the MAxS behind SAM's
→