We consider the problem of estimating the parameters a Gaussian Mixture Model with K components of known weights, all with an identity covariance matrix. We make two contributions. First, at the population level, we present a sharper analysis of the local convergence of EM and gradient EM, compared to previous works. Assuming a separation of $\Omega(\sqrt{\log K})$, we prove convergence of both methods to the global optima from an initialization region larger than those of previous works. Specifically, the initial guess of each component can be as far as (almost) half its distance to the nearest Gaussian. This is essentially the largest possible contraction region. Our second contribution are improved sample size requirements for accurate estimation by EM and gradient EM. In previous works, the required number of samples had a quadratic dependence on the maximal separation between the K components, and the resulting error estimate increased linearly with this maximal separation. In this manuscript we show that both quantities depend only logarithmically on the maximal separation.
翻译:我们考虑了用已知重量的 K 元素来估算高斯混血模型参数的问题。 我们做了两种贡献。 首先, 在人口层面, 我们比以往的工程对EM和梯度EM的本地趋同性进行了更精确的分析。 假设用$\ omega (\ sqrt ~log K}) 进行分离, 我们证明这两种方法都与全球奥地马的初始化区域比以往的工程大。 具体地说, 每个元素的初始猜测可以远到( 几乎) 其距离最近的高斯的一半。 这基本上是最大的收缩区域。 我们的第二个贡献是改进了EM 和 梯度 EM 准确估算的样本大小要求。 在以往的工程中, 所需的样本数量对K 组件之间的最大分母值的依赖度, 以及由此得出的错误估计值随着这一最大分数而增加的线性。 我们的手稿显示, 两者的数量都仅取决于最高分数。