瓦西尔斯泰因GANs 带有逐步惩罚加分惩罚的计算分流运输 (Wasserstein GANs with Gradient Penalty Compute Congested Transport)

Wasserstein GANs with Gradient Penalty (WGAN-GP) are an extremely popular method for training generative models to produce high quality synthetic data. While WGAN-GP were initially developed to calculate the Wasserstein 1 distance between generated and real data, recent works (e.g. Stanczuk et al. (2021)) have provided empirical evidence that this does not occur, and have argued that WGAN-GP perform well not in spite of this issue, but because of it. In this paper we show for the first time that WGAN-GP compute the minimum of a different optimal transport problem, the so-called congested transport (Carlier et al. (2008)). Congested transport determines the cost of moving one distribution to another under a transport model that penalizes congestion. For WGAN-GP, we find that the congestion penalty has a spatially varying component determined by the sampling strategy used in Gulrajani et al. (2017) which acts like a local speed limit, making congestion cost less in some regions than others. This aspect of the congested transport problem is new in that the congestion penalty turns out to be unbounded and depend on the distributions to be transported, and so we provide the necessary mathematical proofs for this setting. We use our discovery to show that the gradients of solutions to the optimization problem in WGAN-GP determine the time averaged momentum of optimal mass flow. This is in contrast to the gradients of Kantorovich potentials for the Wasserstein 1 distance, which only determine the normalized direction of flow. This may explain, in support of Stanczuk et al. (2021), the success of WGAN-GP, since the training of the generator is based on these gradients.

翻译：瓦塞斯坦GANs与GARIEDEN(WGAN-GP)是培训基因模型以产生高质量合成数据的极受欢迎的方法。虽然最初开发WGAN-GP是为了计算生成数据与真实数据之间的瓦塞斯坦1距离,但最近的工作(例如斯坦丘克等人(2021年))提供了经验证据,证明没有发生这种情况,并声称WGAN-GP在这一点上表现良好,但因为这个问题,所以情况并非如此。在本文中,我们第一次显示,WGAN-GP在计算不同最佳运输问题的最低值时,即所谓的凝固运输(Carlier等人(2008年) 。虽然最初开发了WGAN-GP,以计算瓦塞斯坦1号数据与真实数据之间的距离,但是,最近的工作(例如斯坦丘克等人(2021年))提供了经验性证据,证明这种情况没有发生,而且由于格拉贾尼等人(2017年)的取样战略,因此情况并不那么好,因此在某些区域造成拥堵塞成本比其他区域要低一些。在凝固的运输问题上,这种卡路变变变的运输问题是新的,这种螺流流流趋势,从1号流流流到SAL的推算到我们需要的螺旋,从SOL的稳定性的流流流,我们开始,从而决定了。