Variational Autoencoders (VAEs) have seen widespread use in learned image compression. They are used to learn expressive latent representations on which downstream compression methods can operate with high efficiency. Recently proposed 'bits-back' methods can indirectly encode the latent representation of images with codelength close to the relative entropy between the latent posterior and the prior. However, due to the underlying algorithm, these methods can only be used for lossless compression, and they only achieve their nominal efficiency when compressing multiple images simultaneously; they are inefficient for compressing single images. As an alternative, we propose a novel method, Relative Entropy Coding (REC), that can directly encode the latent representation with codelength close to the relative entropy for single images, supported by our empirical results obtained on the Cifar10, ImageNet32 and Kodak datasets. Moreover, unlike previous bits-back methods, REC is immediately applicable to lossy compression, where it is competitive with the state-of-the-art on the Kodak dataset.
翻译:已知的图像压缩中广泛使用变化式自动编码器(VAEs),它们被用于学习下游压缩方法能够高效运行的表达式潜在表达方式。最近提出的“回位法”方法可以间接地将图像的潜在表达方式编码成代码长度接近潜伏后后方和前方之间相对酶的编码。然而,由于基本的算法,这些方法只能用于无损压缩,只有在同时压缩多个图像时,它们才能达到其名义效率;它们对于压缩单个图像来说是低效的。作为一种替代办法,我们提出了一个新颖的方法,即相对 Entropy Coding(REC),它可以直接将潜在表达方式编码成代码,其代号与单个图像的相对酶相近,并得到我们在Cifar10、图像Net32和Kodak数据集上取得的经验结果的支持。此外,与以往的位后方方法不同,REC直接适用于损失压缩,因为它与Kodak数据集的状态具有竞争力。