BriefGPT.xyz
Sep, 2024
权重归一化的优化和泛化保证
Optimization and Generalization Guarantees for Weight Normalization
HTML
PDF
Pedro Cisneros-Velarde, Zhijie Chen, Sanmi Koyejo, Arindam Banerjee
TL;DR
本研究解决了深度权重归一化模型在优化和泛化方面的理论空白。本文首次提供了权重归一化模型的优化和泛化的理论特性,特别是提出了光滑激活函数下的收敛性和一致性界限。实验结果表明,归一化项与深度神经网络的训练效果密切相关,具有重要的应用潜力。
Abstract
Weight Normalization
(WeightNorm) is widely used in practice for the training of deep neural networks and modern
Deep Learning
libraries have built-in implementations of it. In this paper, we provide the first th
→