BriefGPT.xyz
Dec, 2020
超越离线映射:通过上下文锚定学习跨语言词向量
Beyond Offline Mapping: Learning Cross Lingual Word Embeddings through Context Anchoring
HTML
PDF
Aitor Ormazabal, Mikel Artetxe, Aitor Soroa, Gorka Labaka, Eneko Agirre
TL;DR
本研究提出了一种基于弱监督(仅有相同单词列表)的方法,通过固定目标语言的嵌入并学习与之对齐的源语言的嵌入来解决不同语言的单词嵌入相似性不一致的问题,并在双语词表归纳和XNLI任务上取得了较好的结果,相比于传统的映射方法表现更好。
Abstract
Recent research on
cross-lingual word embeddings
has been dominated by
unsupervised mapping
approaches that align monolingual embeddings. Such methods critically rely on those embeddings having a similar structur
→