BriefGPT.xyz
Apr, 2023
深度神经网络中权重矩阵的重尾正则化
Heavy-Tailed Regularization of Weight Matrices in Deep Neural Networks
HTML
PDF
Xuanzhe Xiao, Zeng Li, Chuanlong Xie, Fengwei Zhou
TL;DR
通过随机矩阵理论,提出了一种名为“Heavy-Tailed Regularization”的正则化技术,此技术优化了神经网络的权重矩阵,使其有更重的尾巴,并提升了网络的泛化能力。对比传统的正则化方法,实验结果证明这种新方法在泛化效果上更优秀。
Abstract
Unraveling the reasons behind the remarkable success and exceptional generalization capabilities of
deep neural networks
presents a formidable challenge. Recent insights from
random matrix theory
, specifically th
→