多现实感图像压缩与条件生成器 (Multi-Realism Image Compression with a Conditional Generator)

By optimizing the rate-distortion-realism trade-off, generative compression approaches produce detailed, realistic images, even at low bit rates, instead of the blurry reconstructions produced by rate-distortion optimized models. However, previous methods do not explicitly control how much detail is synthesized, which results in a common criticism of these methods: users might be worried that a misleading reconstruction far from the input image is generated. In this work, we alleviate these concerns by training a decoder that can bridge the two regimes and navigate the distortion-realism trade-off. From a single compressed representation, the receiver can decide to either reconstruct a low mean squared error reconstruction that is close to the input, a realistic reconstruction with high perceptual quality, or anything in between. With our method, we set a new state-of-the-art in distortion-realism, pushing the frontier of achievable distortion-realism pairs, i.e., our method achieves better distortions at high realism and better realism at low distortion than ever before.

翻译：通过优化速率-失真-现实感之间的权衡，生成式压缩方法可以在低比特率下生成详细、逼真的图像，而不是失真的重构。然而，以往的方法并没有明确地控制合成的细节数量，这导致了这些方法的一个普遍批评：用户可能担心生成了远离输入图像的误导性重构。在这项研究中，我们通过训练一个解码器来缓解这些顾虑，这个解码器可以在两个领域之间进行转换，同时进行失真-现实感的权衡。从单一的压缩表示中，接收者可以决定重构一个低均方误差的与输入接近的重构，或者高感知质量的逼真重构，或者两者之间的任何东西。通过我们的方法，我们在失真-现实感方面取得了新的最优结果，推动了能够实现失真-现实感对的垂直。具体来说，我们的方法在更高的逼真度上取得更好的失真，而在更低的失真水平上取得更好的真实感，超过了以往的研究成果。

相关内容

生成器

关注 2

生成器是一次生成一个值的特殊类型函数。可以将其视为可恢复函数。调用该函数将返回一个可用于生成连续 x 值的生成【Generator】，简单的说就是在函数的执行过程中，yield语句会把你需要的值返回给调用生成器的地方，然后退出函数，下一次调用生成器函数的时候又从上次中断的地方开始执行，而生成器内的所有变量参数都会被保存下来供下一次使用。

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

专知会员服务

34+阅读 · 2020年6月19日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日