Lossy image compression is often limited by the simplicity of the chosen loss measure. Recent research suggests that generative adversarial networks have the ability to overcome this limitation and serve as a multi-modal loss, especially for textures. Together with learned image compression, these two techniques can be used to great effect when relaxing the commonly employed tight measures of distortion. However, convolutional neural network based algorithms have a large computational footprint. Ideally, an existing conventional codec should stay in place, which would ensure faster adoption and adhering to a balanced computational envelope. As a possible avenue to this goal, in this work, we propose and investigate how learned image coding can be used as a surrogate to optimize an image for encoding. The image is altered by a learned filter to optimise for a different performance measure or a particular task. Extending this idea with a generative adversarial network, we show how entire textures are replaced by ones that are less costly to encode but preserve sense of detail. Our approach can remodel a conventional codec to adjust for the MS-SSIM distortion with over 20% rate improvement without any decoding overhead. On task-aware image compression, we perform favourably against a similar but codec-specific approach.
翻译:最近的研究表明,基因对抗网络有能力克服这一限制,并成为多式损失,特别是质地。与学习到的图像压缩一起,这两种技术在放松通常采用的扭曲的严格测量方法时可以使用大效果。然而,以神经神经网络为基础的演进算法具有巨大的计算足迹。理想的情况是,现有的常规编码器应该留在原地,以确保更快地采用并遵守平衡的计算信封。作为实现这一目标的可能途径,我们在此工作中提议并调查如何利用学习到的图像编码编码作为优化编码图像的替代工具。通过学习到的过滤器来优化不同的性能度量或特定任务。用基因对抗网络来扩展这一想法,我们展示整个纹理如何被那些对编码来说成本较低但能保持详细感的文字所取代。我们的方法可以重塑一种用于调整MS-SSIM扭曲的常规编码器件,在20%以上的频率上进行改进,但不作任何类似的解分解管理器。