Diffusion probabilistic models have recently achieved remarkable success in generating high quality image and video data. In this work, we build on this class of generative models and introduce a method for lossy compression of high resolution images. The resulting codec, which we call DIffuson-based Residual Augmentation Codec (DIRAC),is the first neural codec to allow smooth traversal of the rate-distortion-perception tradeoff at test time, while obtaining competitive performance with GAN-based methods in perceptual quality. Furthermore, while sampling from diffusion probabilistic models is notoriously expensive, we show that in the compression setting the number of steps can be drastically reduced.
翻译:传播概率模型最近在生成高质量图像和视频数据方面取得了显著的成功。 在这项工作中,我们利用了这一类基因模型并引入了高分辨率图像丢失压缩的方法。 由此产生的代码(我们称之为Diffuson的Diffuson剩余增强代码(DIRAC))是第一个允许测试时顺利地跨过率扭曲-感知取舍的神经代码,同时在感知质量方面获得以GAN为基础的方法的竞争性性能。 此外,虽然从扩散概率模型中取样的成本非常高,但我们在压缩设定步骤数量时可以大幅降低。