Wasserstein GANs (WGANs), built upon the Kantorovich-Rubinstein (KR) duality of Wasserstein distance, is one of the most theoretically sound GAN models. However, in practice it does not always outperform other variants of GANs. This is mostly due to the imperfect implementation of the Lipschitz condition required by the KR duality. Extensive work has been done in the community with different implementations of the Lipschitz constraint, which, however, is still hard to satisfy the restriction perfectly in practice. In this paper, we argue that the strong Lipschitz constraint might be unnecessary for optimization. Instead, we take a step back and try to relax the Lipschitz constraint. Theoretically, we first demonstrate a more general dual form of the Wasserstein distance called the Sobolev duality, which relaxes the Lipschitz constraint but still maintains the favorable gradient property of the Wasserstein distance. Moreover, we show that the KR duality is actually a special case of the Sobolev duality. Based on the relaxed duality, we further propose a generalized WGAN training scheme named Sobolev Wasserstein GAN (SWGAN), and empirically demonstrate the improvement of SWGAN over existing methods with extensive experiments.
翻译:瓦西斯坦GANs(WGANs)建立在瓦西斯坦距离的Kantorovich-Rubinstein(KR)双轨制之上,是理论上最健全的GAN模式之一,但实际上,它并不总是优于GANs的其他变异,这主要是因为KR双轨制所要求的利普施维茨条件的落实不完善。在社区中,Lipschitz限制的落实情况各不相同,但这种限制仍然难以完全满足。在本文件中,我们认为,强大的利普西茨限制对优化来说可能是不必要的。相反,我们退后一步,试图放松利普施维茨限制。理论上,我们首先展示了瓦西斯坦距离(Sobolevlix)的更普遍的双重形式,即苏博利茨双轨制,这缓解了利普西茨的制约,但仍然保持了利普西茨距离的有利梯度。此外,我们表明,KR双轨制实际上是索博列夫双轨制的一个特殊案例。基于宽松的双轨制,我们进一步提出了普遍改进的双轨制,我们进一步展示了目前以SWWGAN系统取代SOLVAN系统。