Generative Adversarial Networks (GAN) boast impressive capacity to generate realistic images. However, like much of the field of deep learning, they require an inordinate amount of data to produce results, thereby limiting their usefulness in generating novelty. In the same vein, recent advances in meta-learning have opened the door to many few-shot learning applications. In the present work, we propose Few-shot Image Generation using Reptile (FIGR), a GAN meta-trained with Reptile. Our model successfully generates novel images on both MNIST and Omniglot with as little as 4 images from an unseen class. We further contribute FIGR-8, a new dataset for few-shot image generation, which contains 1,548,944 icons categorized in over 18,409 classes. Trained on FIGR-8, initial results show that our model can generalize to more advanced concepts (such as "bird" and "knife") from as few as 8 samples from a previously unseen class of images and as little as 10 training steps through those 8 images. This work demonstrates the potential of training a GAN for few-shot image generation and aims to set a new benchmark for future work in the domain.
翻译:创世Adversarial Networks(GAN)拥有制作现实图像的惊人能力,然而,与许多深层学习领域一样,它们需要大量的数据才能产生结果,从而限制其产生新颖的效用。同样,元学习的最新进展为许多少见的学习应用程序打开了大门。在目前的工作中,我们提议使用微小的图像生成,即GAN经Reptile(FIGR)的元训练后再生。我们的模型成功地在MNIST和Omniglot上生成了新图像,从一个看不见的类别中仅产生4个图像。我们进一步贡献FIGR-8,这是几发图像生成的新数据集,包含1,548,944个在18,409个课程中分类的图标。在FIGR-8上培训的初步结果显示,我们的模型可以归纳出更先进的概念(如“鸟”和“Knifefe”),从一个以前看不见的图像类别中少到8个样本,从这8个新图像的10个培训步骤。我们的工作展示了GAN在几幅未来图像领域培训的可能性。