BriefGPT.xyz
Feb, 2024
关于循环模型在长序列中的复兴:变形器时代的调研和研究机会
On the Resurgence of Recurrent Models for Long Sequences: Survey and Research Opportunities in the Transformer Era
HTML
PDF
Matteo Tiezzi, Michele Casoni, Alessandro Betti, Tommaso Guidi, Marco Gori...
TL;DR
深度学习中基于Transformer和循环神经网络的顺序处理对于处理长序列数据和无限长度序列数据具有重要意义。
Abstract
A longstanding challenge for the
machine learning
community is the one of developing models that are capable of processing and learning from very long sequences of data. The outstanding results of
transformers-based net
→