BriefGPT.xyz
Nov, 2023
鲁棒自动数据聚类:狄利克雷过程遇见中位数均值
Robust and Automatic Data Clustering: Dirichlet Process meets Median-of-Means
HTML
PDF
Supratik Basu, Jyotishka Ray Choudhury, Debolina Paul, Swagatam Das
TL;DR
通过整合基于模型和基于质心的方法,提出了一种高效且自动的聚类技术,解决噪声对聚类质量的影响,并确保无需提前指定聚类数的优点。在模拟和真实数据集上进行了严格评估和统计保证,表明我们提出的方法优于现有先进聚类算法。
Abstract
clustering
stands as one of the most prominent challenges within the realm of unsupervised machine learning. Among the array of centroid-based
clustering
algorithms, the classic $k$-means algorithm, rooted in Llo
→