We consider the sparse moment problem of learning a $k$-spike mixture in high dimensional space from its noisy moment information in any dimension. We measure the accuracy of the learned mixtures using transportation distance. Previous algorithms either assume certain separation assumptions, use more recovery moments, or run in (super) exponential time. Our algorithm for the 1-dimension problem (also called the sparse Hausdorff moment problem) is a robust version of the classic Prony's method, and our contribution mainly lies in the analysis. We adopt a global and much tighter analysis than previous work (which analyzes the perturbation of the intermediate results of Prony's method). A useful technical ingredient is a connection between the linear system defined by the Vandermonde matrix and the Schur polynomial, which allows us to provide tight perturbation bound independent of the separation and may be useful in other contexts. To tackle the high dimensional problem, we first solve the 2-dimensional problem by extending the 1-dimension algorithm and analysis to complex numbers. Our algorithm for the high dimensional case determines the coordinates of each spike by aligning a 1-d projection of the mixture to a random vector and a set of 2d-projections of the mixture. Our results have applications to learning topic models and Gaussian mixtures, implying improved sample complexity results or running time over prior work.
翻译:我们认为,从高维空间从任何维度的噪音瞬间信息中学习$k$-spike混合物,是一个稀疏的瞬间问题。我们用运输距离测量所学混合物的准确性。以前的算法要么假设某些分离假设,使用更多的恢复瞬间,要么在(超级)指数时间运行。我们对一二分解问题的算法(也称为稀疏Hausdorf瞬间问题)是典型的Prony方法的可靠版本,我们的贡献主要在于分析。我们采用了比以往工作(分析Prony方法中间结果的渗透性)更严密的全球分析。一个有用的技术成分是Vandermonde矩阵定义的线性系统与Schur 多元分子(超强) 之间的连接,它使我们能够提供独立于分离的紧凑的扰动性透度(也称为Hausdorform的问题) 。为了解决高维度问题,我们首先将一二分解算法和分析扩大到复杂的数字。我们高维基案例的算法通过调整一个峰值的坐标坐标,将我们之前的模型的模型的模型和升级的混合物的模型的组合结果的随机投影判。