The interest of the deep learning community in image synthesis has grown massively in recent years. Nowadays, deep generative methods, and especially Generative Adversarial Networks (GANs), are leading to state-of-the-art performance, capable of synthesizing images that appear realistic. While the efforts for improving the quality of the generated images are extensive, most attempts still consider the generator part as an uncorroborated "black-box". In this paper, we aim to provide a better understanding and design of the image generation process. We interpret existing generators as implicitly relying on sparsity-inspired models. More specifically, we show that generators can be viewed as manifestations of the Convolutional Sparse Coding (CSC) and its Multi-Layered version (ML-CSC) synthesis processes. We leverage this observation by explicitly enforcing a sparsifying regularization on appropriately chosen activation layers in the generator, and demonstrate that this leads to improved image synthesis. Furthermore, we show that the same rationale and benefits apply to generators serving inverse problems, demonstrated on the Deep Image Prior (DIP) method.
翻译:近些年来,深层学习界对图像合成的兴趣大幅增长。如今,深层基因方法,特别是基因反转网络(GANs),正在导致最先进的性能,能够综合现实的图像。虽然提高生成图像质量的努力范围很广,但大多数尝试仍然认为生成器是一个未经证实的“黑盒子”部分。在本文中,我们的目标是为图像生成过程提供更好的理解和设计。我们把现有生成器解释为暗地依赖随机振动模型。更具体地说,我们表明,生成器可被视为革命性螺旋形编码(CSC)及其多语言版本合成过程的表现形式。我们利用这一观察,明确强制对生成器中适当选择的启动层进行调节,并证明这可以改善图像合成。此外,我们表明,同样的原理和好处也适用于用于产生反向问题的生成器,在深图像前(DIP)方法上展示。