We present a novel deep neural network (DNN) architecture for compressing an image when a correlated image is available as side information only at the decoder. This problem is known as distributed source coding (DSC) in information theory. In particular, we consider a pair of stereo images, which generally have high correlation with each other due to overlapping fields of view, and assume that one image of the pair is to be compressed and transmitted, while the other image is available only at the decoder. In the proposed architecture, the encoder maps the input image to a latent space, quantizes the latent representation, and compresses it using entropy coding. The decoder is trained to extract the common information between the input image and the correlated image, using only the latter. The received latent representation and the locally generated common information are passed through a decoder network to obtain an enhanced reconstruction of the input image. The common information provides a succinct representation of the relevant information at the receiver. We train and demonstrate the effectiveness of the proposed approach on the KITTI and Cityscape datasets of stereo image pairs. Our results show that the proposed architecture is capable of exploiting the decoder-only side information, and outperforms previous work on stereo image compression with decoder side information.
翻译:我们展示了一个新颖的深心神经网络(DNN)结构,用于压缩图像,当一个相关图像仅作为侧面信息存在于解码器中时,压缩图像。这个问题在信息理论中被称为分布源编码(DSC) 。特别是,我们考虑的是一副立体图像,由于观测领域重叠,这些图像通常具有高度相关性,并假设对立体图像的一个图像要压缩和传输,而另一图像只能在解码器中提供。在提议的架构中,编码器将输入图像映射到一个潜在空间,对潜在代表方进行量化,并使用加密编码来压缩图像。解码器受过培训,以提取输入图像和关联图像之间的共同信息,仅使用后者。接收到的潜在代表方图像和本地生成的共同信息通过解码器网络传递,以加强对输入图像的重建。共同信息在接收器中提供简明的相关信息描述。我们培训和演示了关于 KITTI 和立体图像侧侧方数据设置的拟议方法的有效性。我们的成果显示的是,我们用先前的立体图像格式显示了该结构。