BriefGPT.xyz
May, 2024
状态空间模型的表达能力:形式语言视角
The Expressive Capacity of State Space Models: A Formal Language Perspective
HTML
PDF
Yash Sarrof, Yana Veitsman, Michael Hahn
TL;DR
基于线性状态空间模型的循环模型在语言建模方面表现出色,与变压器竞争力强,但对此类模型的原理能力了解甚少,因此我们提出了一项理论研究,比较了这种模型与变压器和传统循环神经网络的能力,发现它们有重叠但有区别的优势。
Abstract
Recently,
recurrent models
based on
linear state space models
(SSMs) have shown promising performance in
language modeling
(LM), competiti
→