Apr, 2024

复仇之后?循环模型与变形金刚在预测人类语言理解度量方面相匹敌

TL;DRRNN architectures RWKV and Mamba perform natural language tasks comparably to or better than transformers, challenging the notion that transformers are uniquely suited for modeling online human language comprehension.