implicit feedback data is extensively explored in recommendation as it is
easy to collect and generally applicable. However, predicting users' preference
on implicit feedback data is a challenging task since we c
本文提出了一种基于概率差距的 PU 学习算法,该算法通过利用条件概率 P (Y=1|X) 对正样例进行有偏重采样,并将未标记数据视为噪声负样例,从而自动标记一组正负样例,这些样例的标签与贝叶斯最优分类器分配的标签相同。通过核均值匹配技术纠正其偏差。实验结果表明,该方法在生成的和现实世界的数据集上均表现良好。