BriefGPT.xyz
Oct, 2020
基于维度选择的内在探测
Intrinsic Probing through Dimension Selection
HTML
PDF
Lucas Torroba Hennigen, Adina Williams, Ryan Cotterell
TL;DR
本文讨论了自然语言处理系统中之前探测语言结构方法的缺陷,并提出了基于多元高斯探针的内在探测框架,以便于检测词向量的语言信息。通过36种语言的实验证明,多数形态语法特征由少数神经元可靠编码,而fastText相较于BERT更加集中其语言结构。
Abstract
Most modern
nlp systems
make use of
pre-trained contextual representations
that attain astonishingly high performance on a variety of tasks. Such high performance should not be possible unless some form of
→