Generative adversarial networks (GANs) have proven effective in modeling distributions of high-dimensional data. However, their training instability is a well-known hindrance to convergence, which results in practical challenges in their applications to novel data. Furthermore, even when convergence is reached, GANs can be affected by mode collapse, a phenomenon for which the generator learns to model only a small part of the target distribution, disregarding the vast majority of the data manifold or distribution. This paper addresses these challenges by introducing SetGAN, an adversarial architecture that processes sets of generated and real samples, and discriminates between the origins of these sets (i.e., training versus generated data) in a flexible, permutation invariant manner. We also propose a new metric to quantitatively evaluate GANs that does not require previous knowledge of the application, apart from the data itself. Using the new metric, in conjunction with the state-of-the-art evaluation methods, we show that the proposed architecture, when compared with GAN variants stemming from similar strategies, produces more accurate models of the input data in a way that is also less sensitive to hyperparameter settings.
翻译:模拟性对抗网络(GANs)在模拟高维数据分布方面证明是行之有效的,然而,它们的训练不稳定是众所周知的阻碍趋同的障碍,因此在应用新数据方面造成了实际的挑战。此外,即使达到趋同,GANs也可能受到模式崩溃的影响,对于这种现象,发电机学会只模拟目标分布的一小部分,而忽略了绝大多数数据的多样性或分布。本文件通过引入SetGAN来应对这些挑战,SEGAN是一个处理成套生成和真实样本的对立结构,以灵活和变异的方式区分这些数据集的起源(即培训与生成的数据)。我们还提出了一个新的指标,对GANs进行定量评估,除了数据本身之外,不需要事先了解应用情况。我们利用新的指标,与最新评价方法一起,表明拟议的结构与来自类似战略的GAN变异体相比,以对超光度设置不那么敏感的方式生成更精确的输入数据模型。