BriefGPT.xyz
Mar, 2024
双向对称长距离DNA序列建模
Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling
HTML
PDF
Yair Schiff, Chia-Hsiang Kao, Aaron Gokaslan, Tri Dao, Albert Gu...
TL;DR
提出了一种基于长程Mamba块构建的双向长程DNA语言模型,命名为Caduceus,它在下游基准测试中表现优于之前的长程模型,并在具有挑战性的长程变体效应预测任务中超越不利用双向性或等变性的10倍更大的模型。
Abstract
large-scale sequence modeling
has sparked rapid advances that now extend into biology and genomics. However, modeling
genomic sequences
introduces challenges such as the need to model long-range token interaction
→