Optimal transport maps between two probability distributions $\mu$ and $\nu$ on $\mathbb{R}^d$ have found extensive applications in both machine learning and statistics. In practice, these maps need to be estimated from data sampled according to $\mu$ and $\nu$. Plug-in estimators are perhaps most popular in estimating transport maps in the field of computational optimal transport. In this paper, we provide a comprehensive analysis of the rates of convergences for general plug-in estimators defined via barycentric projections. Our main contribution is a new stability estimate for barycentric projections which proceeds under minimal smoothness assumptions and can be used to analyze general plug-in estimators. We illustrate the usefulness of this stability estimate by first providing rates of convergence for the natural discrete-discrete and semi-discrete estimators of optimal transport maps. We then use the same stability estimate to show that, under additional smoothness assumptions of Besov type or Sobolev type, wavelet based or kernel smoothed plug-in estimators respectively speed up the rates of convergence and significantly mitigate the curse of dimensionality suffered by the natural discrete-discrete/semi-discrete estimators. As a by-product of our analysis, we also obtain faster rates of convergence for plug-in estimators of $W_2(\mu,\nu)$, the Wasserstein distance between $\mu$ and $\nu$, under the aforementioned smoothness assumptions, thereby complementing recent results in Chizat et al. (2020). Finally, we illustrate the applicability of our results in obtaining rates of convergence for Wasserstein barycenters between two probability distributions and obtaining asymptotic detection thresholds for some recent optimal-transport based tests of independence.
翻译:在计算最佳运输领域,最佳估算器在估算运输图时可能最受欢迎。在本文中,我们对通过巴里中心预测定义的普通缓冲和估测器的趋同率进行了全面分析。我们的主要贡献是,在最平滑的假设下,根据最平滑的假设,在机器学习和统计中,对鲁莽的预测进行新的稳定性估计。在实际中,这些地图需要根据按美元和美元进行抽样的数据来估算。我们首先为计算最佳运输领域的自然离散和半分解的估测器提供最佳运输图的趋同率,从而展示了这种稳定性估计的有用性。我们用同样的稳定性估计来显示,根据Besov 类型或Sobolev 类型的额外平稳假设,以波列或内流为基,在最平滑的巴里预测中,在最低平滑的假设下,在最平滑的中间,可以用来分析普通的缓冲结果。我们最近温度和最直径直径直的中间,通过直径直立的内压的内压的内压速度,我们通过直径直径直径直压的内测测测测测测测测得的精度,我们最近压的压的压的正的正的正的压结果。