BriefGPT.xyz
Feb, 2019
星形转换器
Star-Transformer
HTML
PDF
Qipeng Guo, Xipeng Qiu, Pengfei Liu, Yunfan Shao, Xiangyang Xue...
TL;DR
本文介绍了Star-Transformer,一种轻量级的NLP模型,通过精细的稀疏化将全连接注意力连接结构替换为星形拓扑结构,将复杂性从二次降为线性,同时保持了捕获局部组合和长距离依赖性的能力,并在四个任务的22个数据集上取得了显著的性能提升。
Abstract
Although the fully-connected attention-based model
transformer
has achieved great successes on many
nlp tasks
, it has heavy structure and usually requires large training data. In this paper, we present the Star-<
→