BriefGPT.xyz
Jan, 2021
注意力可以反映句法结构(如果你允许)
Attention Can Reflect Syntactic Structure (If You Let It)
HTML
PDF
Vinit Ravishankar, Artur Kulmizev, Mostafa Abdou, Anders Søgaard, Joakim Nivre
TL;DR
本研究通过对18种语言进行多语言BERT 的解码实验,以测试依存句法是否反映在注意力模式中的普适性,并归纳出单一注意力头可以以上线准确率解码全树。尝试通过对mBERT 进行监督解析目标的微调,结果表明注意力模式可以代表语言结构。
Abstract
Since the popularization of the
transformer
as a general-purpose feature encoder for
nlp
, many studies have attempted to decode linguistic structure from its novel multi-head attention mechanism. However, much of
→