The Super-Resolution Generative Adversarial Network (SRGAN) is a seminal work that is capable of generating realistic textures during single image super-resolution. However, the hallucinated details are often accompanied with unpleasant artifacts. To further enhance the visual quality, we thoroughly study three key components of SRGAN - network architecture, adversarial loss and perceptual loss, and improve each of them to derive an Enhanced SRGAN (ESRGAN). In particular, we introduce the Residual-in-Residual Dense Block (RRDB) without batch normalization as the basic network building unit. Moreover, we borrow the idea from relativistic GAN to let the discriminator predict relative realness instead of the absolute value. Finally, we improve the perceptual loss by using the features before activation, which could provide stronger supervision for brightness consistency and texture recovery. Benefiting from these improvements, the proposed ESRGAN achieves consistently better visual quality with more realistic and natural textures than SRGAN and won the first place in the PIRM2018-SR Challenge. The code is available at https://github.com/xinntao/ESRGAN .
翻译:超分辨率生成反感网络(SRGAN)是一项具有开创性的工作,能够在单一图像超分辨率的单个图像超分辨率中产生现实的质地。然而,幻影细节往往伴有令人不愉快的文物。为了进一步提高视觉质量,我们彻底研究SRGAN的三个关键组成部分:网络结构、对抗性损失和感知损失,并改进其中的每一个组成部分,以获得一个增强的SRGAN(ESRGAN)。特别是,我们引入了未作为基本网络建设单元进行批量正常化的残余-在恢复性共振区(RRRDB ) 。此外,我们从relativical GAN 借用了这个想法,让歧视者预测相对真实性而不是绝对价值。最后,我们通过在激活前使用功能改进感知性损失,这可以为光度的一致性和质的恢复提供更有力的监督。从这些改进中受益,拟议的ESRGAN 以比SRGAN更现实和自然质的质素质持续提高视觉质量,并赢得PIRM2018-SR挑战的第一个位置。该代码可在 https://SR.ES./x.org./xnn.com查阅。