BriefGPT.xyz
Feb, 2020
通过Wasserstein度量匹配分布来规则神经网络的激活函数
Regularizing activations in neural networks via distribution matching with the Wasserstein metric
HTML
PDF
Taejong Joo, Donggu Kang, Byunghoon Kim
TL;DR
该论文介绍了一种新的正则化方法(PER), 通过将激活在概率分布空间中与标准正态分布进行匹配,从而达到正则化的目的。该方法可以用于图像分类任务和语言建模任务。
Abstract
regularization
and
normalization
have become indispensable components in training deep
neural networks
, resulting in faster training and i
→