BriefGPT.xyz
Feb, 2022
惩罚梯度范数以有效提高深度学习的推广性能
Penalizing Gradient Norm for Efficiently Improving Generalization in Deep Learning
HTML
PDF
Yang Zhao, Hao Zhang, Xiuyuan Hu
TL;DR
本文提出通过附加惩罚损失函数的梯度范数来提高深度神经网络的泛化性能,使用我们的方法可以改善不同数据集上的各种模型的泛化性能,并且最佳情况下可在这些任务上提供新的最先进性能。
Abstract
How to train
deep neural networks
(DNNs) to generalize well is a central concern in deep learning, especially for severely overparameterized networks nowadays. In this paper, we propose an effective method to improve the model
→