使用已分解的高斯-拉拉-拉拉-Logic Mixture 模型和集成残余模块进行图像压缩 (Learned Image Compression with Discretized Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules)

Recently deep learning-based image compression methods have achieved significant achievements and gradually outperformed traditional approaches including the latest standard Versatile Video Coding (VVC) in both PSNR and MS-SSIM metrics. Two key components of learned image compression frameworks are the entropy model of the latent representations and the encoding/decoding network architectures. Various models have been proposed, such as autoregressive, softmax, logistic mixture, Gaussian mixture, and Laplacian. Existing schemes only use one of these models. However, due to the vast diversity of images, it is not optimal to use one model for all images, even different regions of one image. In this paper, we propose a more flexible discretized Gaussian-Laplacian-Logistic mixture model (GLLMM) for the latent representations, which can adapt to different contents in different images and different regions of one image more accurately. Besides, in the encoding/decoding network design part, we propose a concatenated residual blocks (CRB), where multiple residual blocks are serially connected with additional shortcut connections. The CRB can improve the learning ability of the network, which can further improve the compression performance. Experimental results using the Kodak and Tecnick datasets show that the proposed scheme outperforms all the state-of-the-art learning-based methods and existing compression standards including VVC intra coding (4:4:4 and 4:2:0) in terms of the PSNR and MS-SSIM. The project page is at \url{https://github.com/fengyurenpingsheng/Learned-image-compression-with-GLLMM}

翻译：最近深层次的基于学习的图像压缩方法取得了显著成就,并逐渐超过了传统方法,包括PSNR和MS-SSIM衡量标准中的最新标准 Versatile Video Coding (VVC) 。学习的图像压缩框架的两个关键组成部分是潜表层和编码/ 解码网络结构的变异模型。提出了各种模型, 如自动递增、软式、后勤混合、高斯混合和 Laplaceian 。现有的方案只使用其中的一种模型。但是, 由于图像的广度, 对所有图像, 甚至一个图像的不同区域使用一个模型( VVC) 并不理想。在本文件中, 我们提议一个更灵活的离散化化的 Gossian- Laplaceian- Logistical 混合模型( GLLLLMMMM ) 模型, 用于潜在显示不同图像和不同图像区域的不同内容。此外, 在编码/ 脱变异网络设计中, 我们提议一个配置的残余区块块( CRB) 与更多的捷端点连接, 4-rmalal4 4 图像连接。 Cal- slimal- sal- sal- sal 系统可以改进现有数据库的系统, 学习系统。 Cal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- 系统, 系统, 系统可以改进所有的系统, 和 Sal- sal- sal- sal- sal- salmal- sal- salmal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sald- sal- sal- sal- sal- sal- sal- sal- sald- sal- sal- sal- sal- sal- salmal- sald- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- sal- s