在输入- 平等时间学习一个远程简单化 (Learning a Latent Simplex in Input-Sparsity Time)

We consider the problem of learning a latent $k$-vertex simplex $K\subset\mathbb{R}^d$, given access to $A\in\mathbb{R}^{d\times n}$, which can be viewed as a data matrix with $n$ points that are obtained by randomly perturbing latent points in the simplex $K$ (potentially beyond $K$). A large class of latent variable models, such as adversarial clustering, mixed membership stochastic block models, and topic models can be cast as learning a latent simplex. Bhattacharyya and Kannan (SODA, 2020) give an algorithm for learning such a latent simplex in time roughly $O(k\cdot\textrm{nnz}(A))$, where $\textrm{nnz}(A)$ is the number of non-zeros in $A$. We show that the dependence on $k$ in the running time is unnecessary given a natural assumption about the mass of the top $k$ singular values of $A$, which holds in many of these applications. Further, we show this assumption is necessary, as otherwise an algorithm for learning a latent simplex would imply an algorithmic breakthrough for spectral low rank approximation. At a high level, Bhattacharyya and Kannan provide an adaptive algorithm that makes $k$ matrix-vector product queries to $A$ and each query is a function of all queries preceding it. Since each matrix-vector product requires $\textrm{nnz}(A)$ time, their overall running time appears unavoidable. Instead, we obtain a low-rank approximation to $A$ in input-sparsity time and show that the column space thus obtained has small $\sin\Theta$ (angular) distance to the right top-$k$ singular space of $A$. Our algorithm then selects $k$ points in the low-rank subspace with the largest inner product with $k$ carefully chosen random vectors. By working in the low-rank subspace, we avoid reading the entire matrix in each iteration and thus circumvent the $\Theta(k\cdot\textrm{nnz}(A))$ running time.

翻译：我们考虑的是学习一个潜值 $k美元简单( K) { subset\ mathb{ R} d$ 的问题。使用 $A\ in\ mathb{ R\ d\ time n} 美元, 这可以被视为一个数据矩阵, 以随机扰动美元( 可能超过 $ 美元) 中的潜在点为美元( 可能超过 $ 美元 ) 。大量潜伏变量模型, 如对立组合、混合成员对美元组合模型和主题模型, 可以作为学习潜值简单( Bhattacharya) 和 Kannan( SODA, 2020) 来提供一种算法, 大约 O( k\ dock\ t\ t\ tlickr{ nnz} (A) 美元, 其中美元是美元( 美元) 任意的潜值。我们的读数在运行期间对美元的基数是不必要的自然假设美元( 美元) 美元) 和美元( 美元) 美元( 美元) ( 美元) ( 美元) ( 美元) ( 美元) ( 美元) ( 美元) ( 美元) ( 美元) ( 美元) 的直数 ( 的直径) ( 开始一个直径开始一个直径) 预算) 开始一个直数开始一个开始一个开始一个的开始一个开始一个的的直径的开始一个快速算的數的一种的的的的的开始一个的的的。