BriefGPT.xyz
Oct, 2019
改进循环神经网络的门控机制
Improving the Gating Mechanism of Recurrent Neural Networks
HTML
PDF
Albert Gu, Caglar Gulcehre, Tom Le Paine, Matt Hoffman, Razvan Pascanu
TL;DR
通过引入两个改进标准门控机制的修改,解决了门控机制在饱和状态下学习梯度的问题,在模拟记忆任务、序列图像分类、语言建模和强化学习等应用中有效提高了循环模型的性能。
Abstract
gating mechanisms
are widely used in
neural network models
, where they allow gradients to backpropagate more easily through depth or time. However, their saturation property introduces problems of its own. For ex
→