In this paper, we study Wasserstein Generative Adversarial Networks (WGANs) using GroupSort neural networks as discriminators. We show that the error bound for the approximation of target distribution depends on both the width/depth (capacity) of generators and discriminators, as well as the number of samples in training. A quantified generalization bound is established for Wasserstein distance between the generated distribution and the target distribution. According to our theoretical results, WGANs have higher requirement for the capacity of discriminators than that of generators, which is consistent with some existing theories. More importantly, overly deep and wide (high capacity) generators may cause worse results (after training) than low capacity generators if discriminators are not strong enough. Numerical results on the synthetic data (swiss roll) and MNIST data confirm our theoretical results, and demonstrate that the performance by using GroupSort neural networks as discriminators is better than that of the original WGAN.
翻译:在本文中,我们用GroupSort神经网络(WGANs)作为歧视者来研究瓦塞尔斯坦基因突变网络(WGANs),我们用GroupSort神经网络(GroupSort Enalment Aversarial Networks)作为歧视者来研究瓦塞尔斯坦基因突变网络(WGANs),我们发现,目标分布近似的误差取决于发电机和导体的宽度/深度(能力)以及培训中的样本数量,对瓦塞斯坦射电分布和目标分布之间的距离规定了量化的通用约束。根据我们的理论结果,WGANs对歧视者的能力的要求高于与某些现有理论相一致的发电机的能力。 更重要的是,如果歧视者不够强大,过度深度和广度(高容量)的发电机可能会造成比低容量发电机更坏的结果(培训后),如果歧视者不够强大的话。 合成数据(swis滚)的数值结果和MNIST数据证实了我们的理论结果,并表明利用GromSort神经网络作为歧视者的表现比原WGAN要好。