计算瓦塞斯泰因百利中心商的快速全球线性一致算法 (A Fast Globally Linearly Convergent Algorithm for the Computation of Wasserstein Barycenters)

We consider the problem of computing a Wasserstein barycenter for a set of discrete probability distributions with finite supports, which finds many applications in areas such as statistics, machine learning and image processing. When the support points of the barycenter are pre-specified, this problem can be modeled as a linear programming (LP) problem whose size can be extremely large. To handle this large-scale LP, we analyse the structure of its dual problem, which is conceivably more tractable and can be reformulated as a well-structured convex problem with 3 kinds of block variables and a coupling linear equality constraint. We then adapt a symmetric Gauss-Seidel based alternating direction method of multipliers (sGS-ADMM) to solve the resulting dual problem and establish its global convergence and global linear convergence rate. As a critical component for efficient computation, we also show how all the subproblems involved can be solved exactly and efficiently. This makes our method suitable for computing a Wasserstein barycenter on a large-scale data set, without introducing an entropy regularization term as is commonly practiced. In addition, our sGS-ADMM can be used as a subroutine in an alternating minimization method to compute a barycenter when its support points are not pre-specified. Numerical results on synthetic data sets and image data sets demonstrate that our method is highly competitive for solving large-scale Wasserstein barycenter problems, in comparison to two existing representative methods and the commercial software Gurobi.

翻译：我们考虑用一定支持来计算一组离散概率分布的瓦塞斯坦中点的问题, 后者在统计、机器学习和图像处理等领域有许多应用。当预指定了该中点的支持点时, 这个问题可以模拟成一个其大小可能非常大的线性编程问题。要处理这个大型 LP, 我们分析其双重问题的结构, 这个问题可以想象地更可移植, 并且可以重拟成一个结构完善的、具有3类轮式变量和线性平等制约等结构完善的合成软体问题。然后我们调整一个基于乘数交替方向方法的对称高尔斯- 赛德尔( GS- ADMM), 以解决由此产生的双重问题, 并确定其全球趋同和全球线性趋同率。作为高效计算的关键组成部分, 我们还要展示它所涉及的所有子问题是如何准确和有效地解决的。这使我们的方法适合在大型数据集上计算一个高额的瓦塞斯坦软质性软质质, 并同时不引入一个基于正态正态正态的正统性正统化调术语, 作为普通的正统方法, 。当我们的正统方法使用时, 我们的正统性的正统性将它作为一个普通的正统的正统数据, 。