We describe a novel lossy compression approach called DiffC which is based on unconditional diffusion generative models. Unlike modern compression schemes which rely on transform coding and quantization to restrict the transmitted information, DiffC relies on the efficient communication of pixels corrupted by Gaussian noise. We implement a proof of concept and find that it works surprisingly well despite the lack of an encoder transform, outperforming the state-of-the-art generative compression method HiFiC on ImageNet 64x64. DiffC only uses a single model to encode and denoise corrupted pixels at arbitrary bitrates. The approach further provides support for progressive coding, that is, decoding from partial bit streams. We perform a rate-distortion analysis to gain a deeper understanding of its performance, providing analytical results for multivariate Gaussian data as well as initial results for general distributions. Furthermore, we show that a flow-based reconstruction achieves a 3 dB gain over ancestral sampling at high bitrates.
翻译:我们描述了一种新型的失传压缩方法,即基于无条件扩散基因模型的DiffC。与依赖转换编码和量化来限制传送信息的现代压缩方法不同,DiffC依赖高山噪音腐蚀的像素的有效通信。我们运用了概念证明,发现尽管没有编码器变形,它的运作效果令人惊讶,优于图像网64x64上最先进的基因压缩方法HIFIC。DiffC只使用单一模型来任意比特率编码和沉思腐蚀像素。这种方法进一步为逐步编码提供支持,即从部分位流解码。我们进行了率扭曲分析,以加深对其性能的理解,为多变制高斯数据提供了分析结果,并为一般分布提供了初步结果。此外,我们显示,基于流动的重建在高比特率的祖传采样上取得了3 dB的收益。