上下文化词表示的低维线性几何

May, 2021

The Low-Dimensional Linear Geometry of Contextualized Word Representations

Evan Hernandez, Jacob Andreas

TL;DR本文研究了ELMO和BERT中的单词表示的线性几何，发现低维子空间编码了各种语言特征，包括结构化依赖关系，子空间之间存在着层次关系，可以用于对BERT的输出分布进行细粒度的操作。

Abstract

Black-box probing models can reliably extract linguistic features like tense, number, and syntactic role from pretrained word representations. However, the manner in which these features are encoded in representations remains poorly understood. We present a systematic study of the