BriefGPT.xyz
Jun, 2019
分层最优输运用于文档表示
Hierarchical Optimal Transport for Document Representation
HTML
PDF
Mikhail Yurochkin, Sebastian Claici, Edward Chien, Farzaneh Mirzazadeh, Justin Solomon
TL;DR
该论文介绍了基于分布和话题建模的层次最优输运方法作为文档之间的元距离,以量化文档之间的相似性。这种方法具有解释性和可扩展性,并在 k-NN 分类方面表现良好。
Abstract
The ability to measure
similarity
between
documents
enables intelligent
summarization
and analysis of large corpora. Past distances betwee
→