BriefGPT.xyz
Apr, 2020
信息论探针用于语言结构探索
Information-Theoretic Probing for Linguistic Structure
HTML
PDF
Tiago Pimentel, Josef Valvoda, Rowan Hall Maudslay, Ran Zmigrod, Adina Williams...
TL;DR
本文介绍了一种基于信息理论的方法来评估神经网络对自然语言处理的理解程度,即探针,发现在评估中应选择表现最好的模型,即使它是更复杂的模型,以获得更紧密的估计和更多的语言信息。作者在多种语言数据集上进行实验验证了这种方法的有效性。
Abstract
The success of
neural networks
on a diverse set of
nlp tasks
has led researchers to question how much do these networks actually know about natural language. Probes are a natural way of assessing this. When
→