Learned image compression based on neural networks have made huge progress thanks to its superiority in learning better representation through non-linear transformation. Different from traditional hybrid coding frameworks, that are commonly block-based, existing learned image codecs usually process the images in a full-resolution manner thus not supporting acceleration via parallelism and explicit prediction. Compared to learned image codecs, traditional hybrid coding frameworks are in general hand-crafted and lack the adaptability of being optimized according to heterogeneous metrics. Therefore, in order to collect their good qualities and offset their weakness, we explore a learned block-based hybrid image compression (LBHIC) framework, which achieves a win-win between coding performance and efficiency. Specifically, we introduce block partition and explicit learned predictive coding into learned image compression framework. Compared to prediction through linear weighting of neighbor pixels in traditional codecs, our contextual prediction module (CPM) is designed to better capture long-range correlations by utilizing the strip pooling to extract the most relevant information in neighboring latent space. Moreover, to alleviate blocking artifacts, we further propose a boundary-aware post-processing module (BPM) with the importance of edge taken into account. Extensive experiments demonstrate that the proposed LBHIC codec outperforms state-of-the-art image compression methods in terms of both PSNR and MS-SSIM metrics and promises obvious time-saving.
翻译:基于神经网络的总结图像压缩由于在通过非线性变换学习更好的代表性方面具有优势而取得了巨大的进步。不同于传统的混合混合编码框架,这种框架通常以整块为基础,现有已学习的图像编码器通常以完全解析的方式处理图像,因此不通过平行和明确预测支持加速。与所学的图像编码器相比,传统的混合编码框架一般是手工制作的,缺乏根据混杂指标优化的适应性。因此,为了收集其优良品质并抵消其弱点,我们探索了一个基于集体的混合图像压缩(LBHIC)框架,该框架在编码性能和效率之间取得了双赢。具体地说,我们引入了块分割和明确学习的预测编码器,将其引入了学习的图像压缩框架。与通过对传统编码中的相邻像素的线性加权预测相比,我们的背景预测模块(CPM)旨在通过利用条式集来提取与周边空间最相关的信息,更好地捕捉到长期的关联性关系。此外,我们进一步建议用一个封闭的制成工艺品制片,从而展示了IMS(B)后级的深度图像模型中的拟议升级模式,同时展示了IMBRO化模式。