BriefGPT.xyz
May, 2024
xLSTM:扩展的长短期记忆
xLSTM: Extended Long Short-Term Memory
HTML
PDF
Maximilian Beck, Korbinian Pöppel, Markus Spanring, Andreas Auer, Oleksandra Prudnikova...
TL;DR
我们修改并扩展LSTM的门控机制和记忆结构,得到了xLSTM模型,该模型在性能和规模上与最先进的Transformer模型和状态空间模型相比表现出色。
Abstract
In the 1990s, the constant error carousel and gating were introduced as the central ideas of the
long short-term memory
(
lstm
). Since then, LSTMs have stood the test of time and contributed to numerous deep learn
→