低调压缩高维模拟模拟数据的确定性矩阵草图草图 (Deterministic matrix sketches for low-rank compression of high-dimensional simulation data)

Matrices arising in scientific applications frequently admit linear low-rank approximations due to smoothness in the physical and/or temporal domain of the problem. In large-scale problems, computing an optimal low-rank approximation can be prohibitively expensive. Matrix sketching addresses this by reducing the input matrix to a smaller, but representative matrix via a low-dimensional linear embedding. If the sketch matrix produced by the embedding captures sufficient geometric properties of the original matrix, then a near-optimal approximation may be obtained. Much of the work done in matrix sketching has centered on random projection. Alternatively, in this work, deterministic matrix sketches which generate coarse representations, compatible with the corresponding PDE solve, are considered in the computation of the singular value decomposition and matrix interpolative decomposition. The deterministic sketching approaches in this work have many advantages over randomized sketches. Broadly, randomized sketches are data-agnostic, whereas the proposed sketching methods exploit structures within data generated in complex PDE systems. These deterministic sketches are often faster, require access to a small fraction of the input matrix, and do not need to be explicitly constructed. A novel single-pass, i.e., requiring one read over the input, power iteration algorithm is also presented. The power iteration method is particularly effective in improving low-rank approximations when the singular value decay of data is slow. Finally, theoretical error bounds and estimates, as well as numerical results across three application problems, are provided.

翻译：科学应用中产生的矩阵往往会接受由于问题物理和/或时间领域的平稳而导致的线性低位近似值。在大规模的问题中,计算一个最佳低位近似值可能极其昂贵。矩阵草图通过低维线嵌入将输入矩阵降低到一个较小但有代表性的矩阵。如果嵌入中生成的草图矩阵能够捕捉原始矩阵的足够几何特性,那么就可以获得近于最佳的近似近似值。矩阵草图中完成的许多工作都集中在随机预测上。或者,在这项工作中,在计算单值分解和矩阵间分解时,计算出一个符合相应的 PDE 解析的确定性矩阵草图可能非常昂贵。在计算单值分解和矩阵间解析时,该工程中的确定性草图方法比随机的草图有很多优势。广义的随机草图是数据,而拟议的草图方法则是利用复杂 PDE 系统内生成的数据结构。这些确定性草图往往更快, 需要获得一个小部分的粗略表示, 与对应的粗略表达值的矩阵图, 并且需要一种精确的精确的数值, 最后的算方法是, 。