BriefGPT.xyz
Feb, 2022
UCTopic:无监督对比学习短语表征与主题挖掘
UCTopic: Unsupervised Contrastive Learning for Phrase Representations and Topic Mining
HTML
PDF
Jiacheng Li, Jingbo Shang, Julian McAuley
TL;DR
本文提出了UCTopic,一种使用无监督对比学习框架来学习上下文感知短语表示和进行主题挖掘,通过正对构建来预训练UCTopic,使用聚类辅助对比学习(CCL)显着降低噪声负例,进一步提高了短语表示的类别学习效果,在实体聚类任务上优于现有的同类模型。
Abstract
High-quality
phrase representations
are essential to finding topics and related terms in documents (a.k.a.
topic mining
). Existing phrase representation learning methods either simply combine unigram representati
→