By optimizing the rate-distortion-realism trade-off, generative compression approaches produce detailed, realistic images, even at low bit rates, instead of the blurry reconstructions produced by rate-distortion optimized models. However, previous methods do not explicitly control how much detail is synthesized, which results in a common criticism of these methods: users might be worried that a misleading reconstruction far from the input image is generated. In this work, we alleviate these concerns by training a decoder that can bridge the two regimes and navigate the distortion-realism trade-off. From a single compressed representation, the receiver can decide to either reconstruct a low mean squared error reconstruction that is close to the input, a realistic reconstruction with high perceptual quality, or anything in between. With our method, we set a new state-of-the-art in distortion-realism, pushing the frontier of achievable distortion-realism pairs, i.e., our method achieves better distortions at high realism and better realism at low distortion than ever before.
翻译:基因压缩方法通过优化率扭曲-现实交易,生成了详细、现实的图像,即使是低比特率的图像,而不是由率扭曲优化模型产生的模糊的重建。然而,先前的方法并未明确控制综合了多少细节,从而导致对这些方法的共同批评:用户可能担心错误重建会产生远离输入图像的错误重建。在这项工作中,我们通过培训一个解码器来缓解这些关切,该解码器可以连接两个制度,并引导扭曲-现实主义交易。从一个单一压缩代表器中,接收器可以决定重建一个接近投入的低平均值平方错误重建,一个现实的、具有高感知性质量的重建,或者任何介于两者之间的重建。用我们的方法,我们设置了一个新的扭曲现实主义最新状态,将可实现的扭曲-现实派对子的前沿推开,也就是说,我们的方法在高现实主义和更好的现实主义中,在低扭曲的低扭曲上比以往更能实现更好的扭曲。