This paper studies the estimation of large-scale optimal transport maps (OTM), which is a well-known challenging problem owing to the curse of dimensionality. Existing literature approximates the large-scale OTM by a series of one-dimensional OTM problems through iterative random projection. Such methods, however, suffer from slow or none convergence in practice due to the nature of randomly selected projection directions. Instead, we propose an estimation method of large-scale OTM by combining the idea of projection pursuit regression and sufficient dimension reduction. The proposed method, named projection pursuit Monge map (PPMM), adaptively selects the most ``informative'' projection direction in each iteration. We theoretically show the proposed dimension reduction method can consistently estimate the most ``informative'' projection direction in each iteration. Furthermore, the PPMM algorithm weakly convergences to the target large-scale OTM in a reasonable number of steps. Empirically, PPMM is computationally easy and converges fast. We assess its finite sample performance through the applications of Wasserstein distance estimation and generative models.
翻译:本文研究大规模最佳运输图(OTM)的估算,这是一个众所周知的具有挑战性的问题,这是由于维度的诅咒造成的一个众所周知的难题。现有文献通过迭代随机投影,通过一系列单维的OTM问题,将大型OTM近似于一维的OTM问题。然而,由于随机选择的投影方向的性质,这些方法在实践中缓慢或完全没有趋同。相反,我们提出大规模OTM的估计方法,将投影回归和充分减少尺寸的概念结合起来。拟议的方法,称为投影追逐蒙古地图(PPMM),适应性地选择了每种迭代中最“信息化”的预测方向。我们理论上表明拟议的尺寸削减方法可以一致估计每个迭代中最“信息化”的预测方向。此外,PMM的算法在合理数量的步骤中与目标大型OTM的趋同较弱的趋同。PMM的算法容易计算,并很快趋于一致。我们通过应用瓦斯特斯坦距离估计和基因化模型来评估其有限的样品性。