TL;DR本研究提出一种 “Powered Chinese Restaurant Process” 来防止过度聚类,减少数据的存储成本和沟通成本。
Abstract
dirichlet process mixture (DPM) models tend to produce many small clusters
regardless of whether they are needed to accurately characterize the data -
this is particularly true for large data sets. However, interpretabi