Generative Adversarial Networks (GANs) are formulated as minimax game problems, whereby generators attempt to approach real data distributions by virtue of adversarial learning against discriminators. The intrinsic problem complexity poses the challenge to enhance the performance of generative networks. In this work, we aim to boost model learning from the perspective of network architectures, by incorporating recent progress on automated architecture search into GANs. To this end, we propose a fully differentiable search framework for generative adversarial networks, dubbed alphaGAN. The searching process is formalized as solving a bi-level minimax optimization problem, in which the outer-level objective aims for seeking a suitable network architecture towards pure Nash Equilibrium conditioned on the generator and the discriminator network parameters optimized with a traditional GAN loss in the inner level. The entire optimization performs a first-order method by alternately minimizing the two-level objective in a fully differentiable manner, enabling architecture search to be completed in an enormous search space. Extensive experiments on CIFAR-10 and STL-10 datasets show that our algorithm can obtain high-performing architectures only with 3-GPU hours on a single GPU in the search space comprised of approximate 2 ? 1011 possible configurations. We also provide a comprehensive analysis on the behavior of the searching process and the properties of searched architectures, which would benefit further research on architectures for generative models. Pretrained models and codes are available at https://github.com/yuesongtian/AlphaGAN.
翻译:Adversarial Networks(GANs)是作为小型游戏问题而设计的,使发电机试图通过对歧视者的对抗性学习来处理真实的数据分配问题。内在问题的复杂性对提高基因网络的性能提出了挑战。在这项工作中,我们的目标是将自动化建筑搜索的最新进展纳入GANs,从而从网络结构的角度来看促进示范学习。为此,我们建议为基因对抗性网络建立一个完全不同的搜索框架,称为alphaGAN。搜索进程正式化,解决双级小型数据优化问题,其中外级目标旨在寻找合适的网络结构,实现纯Nash Equilimarrium,以发电机和歧视性网络参数的性能为条件,在内部一级以传统的GAN损失为优化。整个优化将第一阶方法,以完全不同的方式将两级目标缩小,使结构搜索能够在巨大的搜索空间中完成。CIFAR-10和STL-10数据集的广泛实验表明,我们的算法系统可以在G-PRO2搜索过程中获得高性结构的精度结构。我们只能以G-POL 2小时的搜索方式对10结构进行一项可能的搜索。