BriefGPT.xyz
Jun, 2021
基于学习的亚线性时间支持估计
Learning-based Support Estimation in Sublinear Time
HTML
PDF
Talya Eden, Piotr Indyk, Shyam Narayanan, Ronitt Rubinfeld, Sandeep Silwal...
TL;DR
本文提出了一种基于机器学习的辅助估计算法来解决大型数据集中不同元素数量的估计问题,并证明了当预测器正确的逼近因子为常数时,可以显著降低样本复杂度。
Abstract
We consider the problem of estimating the number of
distinct elements
in a large data set (or, equivalently, the
support size
of the distribution induced by the data set) from a random sample of its elements. The
→