BriefGPT.xyz
May, 2019
一种简单有效的正则化方法,用于携带有泛化保证的嘈杂标签数据的训练
Understanding Generalization of Deep Neural Networks Trained with Noisy Labels
HTML
PDF
Wei Hu, Zhiyuan Li, Dingli Yu
TL;DR
探讨在有噪声标签的情况下,过度参数化的深度神经网络的正则化方法,其中比较有效的包括参数与初始化之间的距离和为每个训练示例添加一个可训练的辅助变量,实验结果表明这些方法能够有效提高模型的泛化性,并且泛化误差的上界独立于网络的大小,可达到无噪声标签情况下的水平。
Abstract
over-parameterized deep neural networks
trained by simple first-order methods are known to be able to fit any labeling of data. When the training dataset contains a fraction of
noisy labels
, can neural networks b
→