BriefGPT.xyz
Nov, 2023
层次门控循环神经网络用于序列建模
Hierarchically Gated Recurrent Neural Network for Sequence Modeling
HTML
PDF
Zhen Qin, Songlin Yang, Yiran Zhong
TL;DR
提出了一种具有遗忘门的分层门控递归神经网络(HGRN)模型,其中遗忘门受可学习值下界限制,使得上层能够建模长期依赖,而下层能够建模更局部、短期的依赖关系。通过在语言建模、图像分类和长距离竞技场测试中进行实验,证明了该模型的高效性和有效性。
Abstract
transformers
have surpassed RNNs in popularity due to their superior abilities in parallel training and long-term dependency modeling. Recently, there has been a renewed interest in using
linear rnns
for efficien
→