BriefGPT.xyz
May, 2017
Kronecker 循环单元
Kronecker Recurrent Units
HTML
PDF
Cijo Jose, Moustpaha Cisse, Francois Fleuret
TL;DR
我们提出了称为Kronecker循环单元(KRU)的灵活的递归神经网络模型,它通过Kronecker分解循环矩阵实现RNNs的参数效率,并通过对因子施加软幺正约束克服循环矩阵的病态条件,从而克服递归神经网络过度参数化的问题。 KRU模型可以降低递归权重矩阵中的参数数量三个数量级,而不会牺牲统计性能。
Abstract
Our work addresses two important issues with
recurrent neural networks
: (1) they are over-parameterized, and (2) the recurrence matrix is ill-conditioned. The former increases the
sample complexity
of learning an
→