Recently, many neural network-based image compression methods have shown promising results superior to the existing tool-based conventional codecs. However, most of them are often trained as separate models for different target bit rates, thus increasing the model complexity. Therefore, several studies have been conducted for learned compression that supports variable rates with single models, but they require additional network modules, layers, or inputs that often lead to complexity overhead, or do not provide sufficient coding efficiency. In this paper, we firstly propose a selective compression method that partially encodes the latent representations in a fully generalized manner for deep learning-based variable-rate image compression. The proposed method adaptively determines essential representation elements for compression of different target quality levels. For this, we first generate a 3D importance map as the nature of input content to represent the underlying importance of the representation elements. The 3D importance map is then adjusted for different target quality levels using importance adjustment curves. The adjusted 3D importance map is finally converted into a 3D binary mask to determine the essential representation elements for compression. The proposed method can be easily integrated with the existing compression models with a negligible amount of overhead increase. Our method can also enable continuously variable-rate compression via simple interpolation of the importance adjustment curves among different quality levels. The extensive experimental results show that the proposed method can achieve comparable compression efficiency as those of the separately trained reference compression models and can reduce decoding time owing to the selective compression. The sample codes are publicly available at https://github.com/JooyoungLeeETRI/SCR.
翻译:最近,许多基于神经网络的图像压缩方法显示出了优于现有基于工具的常规调制解码器的有希望的结果。然而,其中多数方法往往被培训为不同目标比位率的单独模型,从而增加模型的复杂性。因此,进行了一些研究,以学习压缩方法支持单一模型的可变率,但需要额外的网络模块、层或投入,往往导致管理管理复杂,或不能提供足够的编码效率。在本文件中,我们首先提议一种选择性压缩方法,将潜在代表部分以完全普遍的方式编码为深层次学习基于可变率的图像压缩。拟议方法在适应性上决定压缩不同目标质量水平的基本代表要素。为此,我们首先制作了一张3D重要性地图,作为输入内容的性质,以代表代表性要素的根本重要性。3D重要性地图随后需要根据不同的目标质量水平调整,同时使用重要调整曲线。经过调整的3D重要性地图最终被转换为3D的二进制掩码,用以确定压缩的基本代表要素。拟议方法可以很容易与现有的压缩模型整合成一个可忽略的可忽略的间接参考值。为此,我们的方法还可以通过测试的精度提高的精度。我们的方法还可以通过可变缩压方法,通过可变的精确化的方法可以实现可变缩化的方法,通过可变缩制方法,通过可变缩制的方法可以实现。