The geometric median, a notion of center for multivariate distributions, has gained recent attention in robust statistics and machine learning. Although conceptually distinct from the mean (i.e., expectation), we demonstrate that both are very close in high dimensions when the dependence between the distribution components is suitably controlled. Concretely, we find an upper bound on the distance that vanishes with the dimension asymptotically, and derive a rate-matching first order expansion of the geometric median components. Simulations illustrate and confirm our results.
翻译:几何中位数作为多元分布的中心概念,近年来在稳健统计学和机器学习中受到关注。尽管其概念与均值(即期望)不同,但我们证明,当分布分量之间的依赖性得到适当控制时,两者在高维空间中非常接近。具体而言,我们找到了一个距离的上界,该上界随维度渐近趋于零,并推导出几何中位数分量的速率匹配一阶展开式。仿真实验验证并说明了我们的结果。