We call a matrix algorithm superfast (aka running at sublinear cost) if it involves much fewer flops and memory cells than the matrix has entries. Using such algorithms is highly desired or even imperative in computations for Big Data, which involve immense matrices and are quite typically reduced to solving linear least squares problem and/or computation of low rank approximation of an input matrix. The known algorithms for these problems are not superfast, but we prove that their certain superfast modifications output reasonable or even nearly optimal solutions for large input classes. We also propose, analyze, and test a novel superfast algorithm for iterative refinement of any crude but sufficiently close low rank approximation of a matrix. The results of our numerical tests are in good accordance with our formal study.
翻译:我们称之为矩阵算法超级快(卡卡以亚线性成本运行),如果它涉及比矩阵条目少得多的浮点数和内存细胞。在计算大数据时,使用这种算法是非常可取的,甚至是必要的,它涉及巨大的矩阵,通常会降低到解决输入矩阵的线性最小方位问题和/或低级近似值的计算。这些问题的已知算法不是超快的,但我们证明,它们的某些超快修改产出对大型输入类来说是合理的,甚至几乎是最佳的解决方案。我们还提议、分析和测试一种新的超快算法,以迭接完善任何粗略但足够接近低级的矩阵。我们的数字测试结果与我们的正式研究完全一致。