BriefGPT.xyz
Dec, 2017
信息瓶颈和几何聚类
The information bottleneck and geometric clustering
HTML
PDF
D J Strouse, David J Schwab
TL;DR
本文基于确定性信息瓶颈(DIB)方法提出了一种新的模型选择算法,将改进后的IB方法用于基于几何距离的聚类,通过识别聚类数与空间信息之间的权衡,有效选择聚类数,实现了对k-means和EM算法的信息论泛化。
Abstract
The
information bottleneck
(IB) approach to
clustering
takes a joint distribution $P\!\left(X,Y\right)$ and maps the data $X$ to cluster labels $T$ which retain maximal information about $Y$ (Tishby et al., 1999)
→