可解释性超密集词向量的分析方法

Apr, 2019

可解释性超密集词向量的分析方法

Analytical Methods for Interpretable Ultradense Word Embeddings

Philipp Dufter, Hinrich Schütze

TL;DR研究word embeddings的可解释性，通过旋转word spaces进行interpretable dimensions的识别并保留原有信息，提出了DensRay方法进行closed form计算，相比于Densifier更加鲁棒，对lexicon induction和word analogy进行了实验，并展示了可解释性word spaces如何应用于去除嵌入中的性别偏见。

Abstract

word embeddings are useful for a wide variety of tasks, but they lack interpretability. By rotating word spaces, interpretable dimensions can be identified while preserving the information contained in the embedd