重温上下文窗口：用于跨语言词嵌入的方法

Apr, 2020

重温上下文窗口：用于跨语言词嵌入的方法

Revisiting the Context Window for Cross-lingual Word Embeddings

Ryokan Ri, Yoshimasa Tsuruoka

TL;DR本研究系统评估了使用不同上下文窗口大小训练的跨语言词嵌入在多种语言、领域和任务中的性能，并发现增加源和目标词窗口大小可以提高双语词汇归纳的性能，尤其是对于频繁的名词。

Abstract

Existing approaches to mapping-based cross-lingual word embeddings are based on the assumption that the source and target embedding spaces are structurally similar. The structures of embedding spaces largely depend on the co-occurrence statistics of each word, which the choice of