BriefGPT.xyz
Jun, 2024
长序列处理中的状态空间建模:对Transformer时代中的循环的调查
State-Space Modeling in Long Sequence Processing: A Survey on Recurrence in the Transformer Era
HTML
PDF
Matteo Tiezzi, Michele Casoni, Alessandro Betti, Marco Gori, Stefano Melacci
TL;DR
对基于循环模型的顺序数据处理的最新方法进行了深入总结,并提供了关于体系结构和算法解决方案的完整分类,引导研究者在这一吸引人的研究领域进行进一步研究。
Abstract
Effectively learning from
sequential data
is a longstanding goal of Artificial Intelligence, especially in the case of
long sequences
. From the dawn of Machine Learning, several researchers engaged in the search
→