This work resolves the following question in non-Euclidean statistics: Is it possible to consistently estimate the Fr\'echet mean set of an unknown population distribution, with respect to the Hausdorff metric, when given access to independent identically-distributed samples? Our affirmative answer is based on a careful analysis of the ``relaxed empirical Fr\'echet mean set estimators'' which identify the set of near-minimizers of the empirical Fr\'echet functional and where the amount of ``relaxation'' vanishes as the number of data tends to infinity. Our main theoretical results include exact descriptions of which relaxation rates give weak consistency and which give strong consistency, as well as the construction of a ``two-step estimator'' which (assuming only the finiteness of certain moments and a mild condition on the metric entropy of the underlying metric space) adaptively finds the fastest possible relaxation rate for strongly consistent estimation. Our main practical result is simply that researchers working with non-Euclidean data in the real world can be better off computing relaxed empirical Fr\'echet mean sets rather than unrelaxed empirical Fr\'echet mean sets.
翻译:这项工作解决了非欧洲的统计中的下列问题:在允许获得独立的相同分布样本时,能否就Hausdorf 衡量标准一致地估算Fr\'echche 平均人口分布的未知值集?我们肯定的答案是基于对“经放松的经验性Fr\'echet 平均估计值”的仔细分析,该分析确定经验性Fr\'echet功能的近最小值组,以及当“放松”的消失量随着数据数量趋于无限时。我们的主要理论结果包括准确描述哪些放松率使一致性弱,并具有很强的连贯性;以及构建“两步定数”的“估量器”(仅假设某些时刻的有限性以及基础物理空间的矩阵的温和条件),以适应性的方式发现最快速的放松率,以便进行非常一致的估计。我们的主要实际结果是,在现实世界中从事非欧洲数据研究的研究人员可以更好地计算宽松的经验性Frrchemet,而不是不折reching 。