We introduce categorical modularity, a novel low-resource intrinsic metric to evaluate word embedding quality. Categorical modularity is a graph modularity metric based on the $k$-nearest neighbor graph constructed with embedding vectors of words from a fixed set of semantic categories, in which the goal is to measure the proportion of words that have nearest neighbors within the same categories. We use a core set of 500 words belonging to 59 neurobiologically motivated semantic categories in 29 languages and analyze three word embedding models per language (FastText, MUSE, and subs2vec). We find moderate to strong positive correlations between categorical modularity and performance on the monolingual tasks of sentiment analysis and word similarity calculation and on the cross-lingual task of bilingual lexicon induction both to and from English. Overall, we suggest that categorical modularity provides non-trivial predictive information about downstream task performance, with breakdowns of correlations by model suggesting some meta-predictive properties about semantic information loss as well.

本文介绍了一种新的低资源内在度量标准称为 categorical modularity，用于评估单词嵌入模型的质量。作者使用具有神经生物学意义的59个语义类别的500个核心词语，在29种语言中分析了三种单词嵌入模型，提出 categorical modularity 与单、跨语言任务性能之间存在中等到强的正相关性。

评估具有分类模块化的词嵌入