BriefGPT.xyz
Dec, 2015
Active Sampler: 面向规模化复杂数据分析的轻量级加速器
Active Sampler: Light-weight Accelerator for Complex Data Analytics at Scale
HTML
PDF
Jinyang Gao, H. V. Jagadish, Beng Chin Ooi
TL;DR
通过研究数据访问模式如何影响模型训练,提出了Active Sampler算法,它可以让训练数据更加集中在有价值的实例附近,实验证明其能够在SVM,特征选择和深度学习中提高训练速度1.6-2.2倍。
Abstract
Recent years have witnessed amazing outcomes from "
big models
" trained by "
big data
". Most popular algorithms for model training are iterative. Due to the surging volumes of data, we can usually afford to process
→