Urvashi Oswal, Swayambhoo Jain, Kevin S. Xu, Brian Eriksson
TL;DR本文研究了采样预定义块以逼近矩阵的问题,应用了一种适用于大型矩阵分布式设置中计算块 CUR 分解的算法,并应用于生物识别数据分析,在真实世界的测试环境中展示了实验结果。
Abstract
A common problem in large-scale data analysis is to approximate a matrix
using a combination of specifically sampled rows and columns, known as CUR
decomposition. Unfortunately, in many real-world environments, the ability to
sample specific individual rows or columns of the matrix is limited by either
system constraints or cost. In this paper, we consider <