Projection robust Wasserstein (PRW) distance, or Wasserstein projection pursuit (WPP), is a robust variant of the Wasserstein distance. Recent work suggests that this quantity is more robust than the standard Wasserstein distance, in particular when comparing probability measures in high-dimensions. However, it is ruled out for practical application because the optimization model is essentially non-convex and non-smooth which makes the computation intractable. Our contribution in this paper is to revisit the original motivation behind WPP/PRW, but take the hard route of showing that, despite its non-convexity and lack of nonsmoothness, and even despite some hardness results proved by~\citet{Niles-2019-Estimation} in a minimax sense, the original formulation for PRW/WPP \textit{can} be efficiently computed in practice using Riemannian optimization, yielding in relevant cases better behavior than its convex relaxation. More specifically, we provide three simple algorithms with solid theoretical guarantee on their complexity bound (one in the appendix), and demonstrate their effectiveness and efficiency by conducing extensive experiments on synthetic and real data. This paper provides a first step into a computational theory of the PRW distance and provides the links between optimal transport and Riemannian optimization.
翻译:瓦森斯坦(PRW)的射线强强瓦森斯坦(Wasserstein)的距离,或瓦森斯坦(Wasserstein)的投影追踪(WPP),是瓦森斯坦距离的强有力变体。最近的工作表明,这一数量比标准的瓦森斯坦距离强,特别是在比较高二兆值的概率测量值时。然而,由于优化模型基本上不是康韦克斯和非斯摩特的,因此计算难以计算,因此排除了实际应用。我们在本文件中的贡献是重新审视WPP/PRW的原始动机,但采取了强硬的路线,表明尽管它不和谐,缺乏不光滑,甚至尽管它比标准的瓦克斯斯坦距离要强一些。 最近的工作表明,这一数量比标准的瓦克斯(citet{Niles-2019-Estimation)的距离要强一些,特别是在比较高。但是,在小麦克斯意义上,PRW/WP/WPP textitit{can}最初的配制式设计方法在实际操作中得到了有效的计算,在相关情况下比其调情调放松。更优。更具体地说,我们提供了三种简单的算法,在复杂的纸张上提供了一种最优化的模型和最优化的模型上提供了一种最佳的模型。