Optimal Transport (OT) has attracted significant interest in the machine learning community, not only for its ability to define meaningful distances between probability distributions -- such as the Wasserstein distance -- but also for its formulation of OT plans. Its computational complexity remains a bottleneck, though, and slicing techniques have been developed to scale OT to large datasets. Recently, a novel slicing scheme, dubbed min-SWGG, lifts a single one-dimensional plan back to the original multidimensional space, finally selecting the slice that yields the lowest Wasserstein distance as an approximation of the full OT plan. Despite its computational and theoretical advantages, min-SWGG inherits typical limitations of slicing methods: (i) the number of required slices grows exponentially with the data dimension, and (ii) it is constrained to linear projections. Here, we reformulate min-SWGG as a bilevel optimization problem and propose a differentiable approximation scheme to efficiently identify the optimal slice, even in high-dimensional settings. We furthermore define its generalized extension for accommodating to data living on manifolds. Finally, we demonstrate the practical value of our approach in various applications, including gradient flows on manifolds and high-dimensional spaces, as well as a novel sliced OT-based conditional flow matching for image generation -- where fast computation of transport plans is essential.
翻译:最优传输(OT)在机器学习领域引起了广泛关注,这不仅源于其定义概率分布间有意义距离的能力——例如Wasserstein距离——还在于其OT规划的数学表述。然而,其计算复杂度仍是瓶颈,切片技术因此被开发用于将OT扩展至大规模数据集。近期,一种称为min-SWGG的新型切片方案将一维规划提升回原始多维空间,最终选择产生最小Wasserstein距离的切片作为完整OT规划的近似。尽管具有计算和理论优势,min-SWGG仍继承了切片方法的典型局限:(i)所需切片数量随数据维度呈指数增长;(ii)受限于线性投影。本文中,我们将min-SWGG重新表述为双层优化问题,并提出一种可微近似方案以高效识别最优切片,即使在高维场景下亦能适用。此外,我们定义了其广义扩展形式,以适应流形上的数据分布。最后,我们通过多种应用验证了所提方法的实用价值,包括流形与高维空间上的梯度流,以及一种基于切片OT的新型条件流匹配图像生成方法——其中传输规划的快速计算至关重要。