In Euclidean spaces, the empirical mean vector as an estimator of the population mean is known to have polynomial concentration unless a strong tail assumption is imposed on the underlying probability measure. The idea of median-of-means tournament has been considered as a way of overcoming the sub-optimality of the empirical mean vector. In this paper, to address the sub-optimal performance of the empirical mean in a more general setting, we consider general Polish spaces with a general metric, which are allowed to be non-compact and of infinite-dimension. We discuss the estimation of the associated population Frechet mean, and for this we extend the existing notion of median-of-means to this general setting. We devise several new notions and inequalities associated with the geometry of the underlying metric, and using them we study the concentration properties of the extended notions of median-of-means as the estimators of the population Frechet mean. We show that the new estimators achieve exponential concentration under only a second moment condition on the underlying distribution, while the empirical Frechet mean has polynomial concentration. We focus our study on spaces with non-positive Alexandrov curvature since they afford slower rates of convergence than spaces with positive curvature. We note that this is the first work that derives non-asymptotic concentration inequalities for extended notions of the median-of-means in non-vector spaces with a general metric.
翻译:在欧几里德空间,经验中值矢量作为人口中位值的测算标准已知具有多角集中度,除非对基本概率的测算施加强烈的尾端假设。中值中位量比赛的想法被视为克服实验中位量矢量的亚最佳性的一种方法。在本文中,为了在更笼统的环境下处理经验中位值的亚最佳性表现,我们认为波兰一般空间具有一般度量,允许不相容和无限分化。我们讨论相关人口Frechet平均值的估计,为此我们将现有的中位值中位值概念扩大到这一总体环境。我们设计了一些与基本度量度的几何性差相关的新概念和不平等。我们研究了中位值中位值作为人口估计值平均值的扩展的集中性特性。我们发现,新的估计值在基本分布的第二个条件下达到指数性集中度,而实验中位法利切值中位值中位值的非中位值中位值则意味着这种非正位值的常态浓度。我们的研究重点是,自亚历山大以来,这种中位值中位值的正位值是非正位值的正位值。