We consider a symmetric mixture of linear regressions with random samples from the pairwise comparison design, which can be seen as a noisy version of a type of Euclidean distance geometry problem. We analyze the expectation-maximization (EM) algorithm locally around the ground truth and establish that the sequence converges linearly, providing an $\ell_\infty$-norm guarantee on the estimation error of the iterates. Furthermore, we show that the limit of the EM sequence achieves the sharp rate of estimation in the $\ell_2$-norm, matching the information-theoretically optimal constant. We also argue through simulation that convergence from a random initialization is much more delicate in this setting, and does not appear to occur in general. Our results show that the EM algorithm can exhibit several unique behaviors when the covariate distribution is suitably structured.
翻译:我们认为,线性回归是线性回归的对称混合物,其样本来自对称比较设计,这可以被视为一种欧洲大陆远距离几何问题的杂音版本。我们分析了地面周围的预期-最大化算法(EM),确定序列线性趋同,为迭代的估计误差提供了$@ ⁇ infty$-norm的保证。此外,我们显示EM序列的极限达到了$@ell_2$-norm的急剧估计率,与信息-理论最佳常数相匹配。我们还通过模拟论证,随机初始化的趋同在这一环境中更为微妙,而且似乎并不普遍。我们的结果表明,当共变式分布结构适当时,EM算法可以显示几种独特的行为。