The generative adversarial network (GAN) framework has emerged as a powerful tool for various image and video synthesis tasks, allowing the synthesis of visual content in an unconditional or input-conditional manner. It has enabled the generation of high-resolution photorealistic images and videos, a task that was challenging or impossible with prior methods. It has also led to the creation of many new applications in content creation. In this paper, we provide an overview of GANs with a special focus on algorithms and applications for visual synthesis. We cover several important techniques to stabilize GAN training, which has a reputation for being notoriously difficult. We also discuss its applications to image translation, image processing, video synthesis, and neural rendering.
翻译:基因对抗网络(GAN)框架已成为各种图像和视频合成任务的有力工具,能够以无条件或输入条件的方式合成视觉内容,能够生成高分辨率光真化图像和视频,这是以往方法具有挑战性或不可能完成的任务,还导致在内容创建方面创造了许多新的应用程序。在本文中,我们介绍了GAN的概况,特别侧重于视觉合成的算法和应用。我们介绍了稳定GAN培训的若干重要技术,这些技术的声誉是众所周知的困难。我们还讨论了其在图像翻译、图像处理、视频合成和神经转换方面的应用。