与通用线性线性模型一起学习高斯语混合体:高二层中精确的症状 (Learning Gaussian Mixtures with Generalised Linear Models: Precise Asymptotics in High-dimensions)

Generalised linear models for multi-class classification problems are one of the fundamental building blocks of modern machine learning tasks. In this manuscript, we characterise the learning of a mixture of $K$ Gaussians with generic means and covariances via empirical risk minimisation (ERM) with any convex loss and regularisation. In particular, we prove exact asymptotics characterising the ERM estimator in high-dimensions, extending several previous results about Gaussian mixture classification in the literature. We exemplify our result in two tasks of interest in statistical learning: a) classification for a mixture with sparse means, where we study the efficiency of $\ell_1$ penalty with respect to $\ell_2$; b) max-margin multi-class classification, where we characterise the phase transition on the existence of the multi-class logistic maximum likelihood estimator for $K>2$. Finally, we discuss how our theory can be applied beyond the scope of synthetic data, showing that in different cases Gaussian mixtures capture closely the learning curve of classification tasks in real data sets.

翻译：多级分类问题的一般线性模型是现代机器学习任务的基本基石之一。在本手稿中,我们描述了通过实验风险最小化(ERM)和任何二次曲线损失和规范化,以通用手段和共差方式学习混合的Gaussian $K$和Gaussian的通用手段和共差。特别是,我们证明在高分层中将机构风险管理估计值描述为高分层,扩展了文献中先前关于高斯混合分类的若干结果。我们举例说明了我们在统计学习中感兴趣的两项任务的结果:(a) 将一种稀有手段的混合物进行分类,我们研究的是美元/ell_1美元的罚款对美元/ell_2美元的效率;(b) 最大间距多级分类,我们描述多级物流估计值存在最高可能性值($>2美元)的阶段过渡。最后,我们讨论了我们的理论如何超越合成数据的范围加以应用,表明在不同情况下高斯混合物能密切捕捉到真实数据集中分类任务的学习曲线。