BriefGPT.xyz
Oct, 2020
针对高效改善泛化性能的锐度感知最小化
Sharpness-Aware Minimization for Efficiently Improving Generalization
HTML
PDF
Pierre Foret, Ariel Kleiner, Hossein Mobahi, Behnam Neyshabur
TL;DR
本文引入了一种新颖、有效的程序,即Sharpness-Aware Minimization(SAM),通过在局部参数空间中同时最小化损失值和损失锐度,以提高模型泛化能力。实验结果表明,SAM在多个数据集和模型上都取得了最新的最好结果,同时也提供了与最先进的噪声标记学习特定过程相当的抗噪性。
Abstract
In today's heavily
overparameterized models
, the value of the training loss provides few guarantees on model
generalization
ability. Indeed, optimizing only the training loss value, as is commonly done, can easil
→