We give a polynomial-time algorithm for the problem of robustly estimating a mixture of $k$ arbitrary Gaussians in $\mathbb{R}^d$, for any fixed $k$, in the presence of a constant fraction of arbitrary corruptions. This resolves the main open problem in several previous works on algorithmic robust statistics, which addressed the special cases of robustly estimating (a) a single Gaussian, (b) a mixture of TV-distance separated Gaussians, and (c) a uniform mixture of two Gaussians. Our main tools are an efficient \emph{partial clustering} algorithm that relies on the sum-of-squares method, and a novel tensor decomposition algorithm that allows errors in both Frobenius norm and low-rank terms.
翻译:我们给出了一种多元时间算法, 解决了强力估算(a) 单一高斯,(b) 电视和距离分离高斯的混合,(c) 两个高斯人的统一混合体。 我们的主要工具是高效的 emph{部分组合制算法,依赖于方对方之和法,以及一种允许弗罗贝尼乌斯常规和低级术语出现错误的新型高尔夫分解算法。