This article introduces a novel structured random matrix composed blockwise from subsampled randomized Hadamard transforms (SRHTs). The block SRHT is expected to outperform well-known dimension reduction maps, including SRHT and Gaussian matrices, on distributed architectures with not too many cores compared to the dimension. We prove that a block SRHT with enough rows is an oblivious subspace embedding, i.e., an approximate isometry for an arbitrary low-dimensional subspace with high probability. Our estimate of the required number of rows is similar to that of the standard SRHT. This suggests that the two transforms should provide the same accuracy of approximation in the algorithms. The block SRHT can be readily incorporated into randomized methods, for instance to compute a low-rank approximation of a large-scale matrix. For completeness, we revisit some common randomized approaches for this problem such as Randomized Singular Value Decomposition and Nystr\"{o}m approximation, with a discussion of their accuracy and implementation on distributed architectures.
翻译:文章引入了由亚抽样的哈达马德随机变异( SRHTs) 构成的新型结构随机矩阵。 区块 SRHT 预计将在分布式结构上超过众所周知的尺寸缩小图, 包括SRHT 和 Gaussian 矩阵, 其核心与尺寸相比并不太多。 我们证明, 带足够行的块 SRHT 是隐蔽的子空间嵌入, 即任意低维次空间的概率高的近似等离子体。 我们对所需行数的估计与标准SRHT相似。 这表明, 两次变异应该提供算法中相似的近似精确度。 区块 SRHT 可以很容易地纳入随机化的方法, 例如, 将一个大尺度矩阵的低级别近似值编成。 为了完整起见, 我们重新审视了这个问题的一些常见随机化方法, 如随机化的 Singal 值脱形和 Nystr\\\\\ {o}m 近似, 讨论其准确性和在分布式结构上的执行情况 。