We consider a novel lossy compression approach based on unconditional diffusion generative models, which we call DiffC. Unlike modern compression schemes which rely on transform coding and quantization to restrict the transmitted information, DiffC relies on the efficient communication of pixels corrupted by Gaussian noise. We implement a proof of concept and find that it works surprisingly well despite the lack of an encoder transform, outperforming the state-of-the-art generative compression method HiFiC on ImageNet 64x64. DiffC only uses a single model to encode and denoise corrupted pixels at arbitrary bitrates. The approach further provides support for progressive coding, that is, decoding from partial bit streams. We perform a rate-distortion analysis to gain a deeper understanding of its performance, providing analytical results for multivariate Gaussian data as well as theoretic bounds for general distributions. Furthermore, we prove that a flow-based reconstruction achieves a 3 dB gain over ancestral sampling at high bitrates.
翻译:我们认为一种基于无条件传播基因模型的新型损失压缩方法,我们称之为DiffC。 与依赖转换编码和量化来限制传送信息的现代压缩方法不同,DiffC依靠高山噪音腐蚀的像素的有效通信。我们实施概念证明,发现它的运作令人惊讶,尽管没有一种编码器变异,优于图像网64x64上最先进的基因压缩方法HIFIC。DiffC只使用一种单一模型来任意比特率编码和加密腐蚀的像素。这种方法进一步为逐步编码提供了支持,即从部分位流解码。我们进行了率扭曲分析,以便更深入了解其性能,为多种变异的高斯数据提供了分析结果,并为一般分布提供了理论界限。此外,我们证明,基于流动的重建在高位点的祖传采样上取得了3 dB的收益。