By exploiting the random sampling techniques, this paper derives an efficient randomized algorithm for computing a generalized CUR decomposition, which provides low-rank approximations of both matrices simultaneously in terms of some of their rows and columns. For large-scale data sets that are expensive to store and manipulate, a new variant of the discrete empirical interpolation method known as L-DEIM, which needs much lower cost and provides a significant acceleration in practice, is also combined with the random sampling approach to further improve the efficiency of our algorithm. Moreover, adopting the randomized algorithm to implement the truncation process of restricted singular value decomposition (RSVD), combined with the L-DEIM procedure, we propose a fast algorithm for computing an RSVD based CUR decomposition, which provides a coordinated low-rank approximation of the three matrices in a CUR-type format simultaneously and provides advantages over the standard CUR approximation for some applications. We establish detailed probabilistic error analysis for the algorithms and provide numerical results that show the promise of our approaches.
翻译:----
通过利用随机采样技术,本文提出了一种有效的随机算法,用于计算广义CUR分解,该分解同时以矩阵的某些行列为代表,提供了低秩近似。针对昂贵存储且操作复杂的大规模数据集,本文还将一种新型离散经验插值方法L-DEIM与随机采样方法相结合,提高了算法的效率。此外,通过采用随机算法实现受限奇异值分解(RSVD)的截断过程,结合L-DEIM过程,我们提出了一种快速算法,用于计算基于RSVD的CUR分解,该分解以CUR类型格式协同近似三个矩阵,使其在某些应用中优于标准CUR近似。我们对算法进行了详细的概率误差分析,并提供了数值结果,证明了我们的方法的可行性。