Low-rank tensor decomposition generalizes low-rank matrix approximation and is a powerful technique for discovering low-dimensional structure in high-dimensional data. In this paper, we study Tucker decompositions and use tools from randomized numerical linear algebra called ridge leverage scores to accelerate the core tensor update step in the widely-used alternating least squares (ALS) algorithm. Updating the core tensor, a severe bottleneck in ALS, is a highly-structured ridge regression problem where the design matrix is a Kronecker product of the factor matrices. We show how to use approximate ridge leverage scores to construct a sketched instance for any ridge regression problem such that the solution vector for the sketched problem is a $(1+\varepsilon)$-approximation to the original instance. Moreover, we show that classical leverage scores suffice as an approximation, which then allows us to exploit the Kronecker structure and update the core tensor in time that depends predominantly on the rank and the sketching parameters (i.e., sublinear in the size of the input tensor). We also give upper bounds for ridge leverage scores as rows are removed from the design matrix (e.g., if the tensor has missing entries), and we demonstrate the effectiveness of our approximate ridge regressioni algorithm for large, low-rank Tucker decompositions on both synthetic and real-world data.
翻译:在本文中,我们研究塔克分解,并使用随机数字线性线性代数(ridge 杠杆杠杆评分)的工具,以加速广泛使用的交替最小方(ALS)算法中的核心振动更新步骤。更新核心振动,这是ALS中一个严重的瓶颈,是一个高度结构化的脊柱回归问题,设计矩阵是要素矩阵的克罗涅克产物。我们展示了如何使用近似脊脊杠杆评分来构建一个草图式实例来应对任何峰值回归问题,例如,用于素描问题的解决方案矢量为$(1 ⁇ varepsilon) 和美元对原例的认可。此外,我们显示经典杠杆计数足以作为近似,从而使我们能够利用Kronecker结构,并在时间上更新核心峰值,这主要取决于因素矩阵的等级和素描参数(i.e., 即精度的次线性山脊杠杆值, 用于从实际输入 stromor 的大小, 和 IMFlickral 上展示了我们所缺的磁带的 。