Image compression is one of the most fundamental techniques and commonly used applications in the image and video processing field. Earlier methods built a well-designed pipeline, and efforts were made to improve all modules of the pipeline by handcrafted tuning. Later, tremendous contributions were made, especially when data-driven methods revitalized the domain with their excellent modeling capacities and flexibility in incorporating newly designed modules and constraints. Despite great progress, a systematic benchmark and comprehensive analysis of end-to-end learned image compression methods are lacking. In this paper, we first conduct a comprehensive literature survey of learned image compression methods. The literature is organized based on several aspects to jointly optimize the rate-distortion performance with a neural network, i.e., network architecture, entropy model and rate control. We describe milestones in cutting-edge learned image-compression methods, review a broad range of existing works, and provide insights into their historical development routes. With this survey, the main challenges of image compression methods are revealed, along with opportunities to address the related issues with recent advanced learning methods. This analysis provides an opportunity to take a further step towards higher-efficiency image compression. By introducing a coarse-to-fine hyperprior model for entropy estimation and signal reconstruction, we achieve improved rate-distortion performance, especially on high-resolution images. Extensive benchmark experiments demonstrate the superiority of our model in rate-distortion performance and time complexity on multi-core CPUs and GPUs.
翻译:图像压缩是最根本的技术之一,也是在图像和视频处理领域常用的应用手段之一; 早期的方法建造了设计良好的管道,并努力通过手工调整改进管道的所有模块; 后来,作出了巨大的贡献,特别是以数据驱动的方法振兴了域,其示范能力和灵活性极强,纳入了新设计的模块和制约因素; 尽管取得了巨大进展,但是缺乏对端对端学习的图像压缩方法的系统基准和全面分析; 在本文件中,我们首先对已学习的图像压缩方法进行全面的文献调查; 文献是根据几个方面组织起来的,目的是通过一个神经网络(即网络结构、通缩模型和节率控制),共同优化速度扭曲性能表现。 我们描述了先进图像压缩方法的先进阶段性能,审查现有的大量工作,并深入了解其历史发展路线。 通过这项调查,我们揭示了图像压缩方法的主要挑战,同时有机会解决与最近先进的高级学习方法有关的问题。 这一分析提供了一个进一步向更高效率的图像压缩模型迈出一步,即网络结构结构、网络结构结构、增缩模型和费率控制。 我们介绍了先进的图像升级模型的升级率的升级,特别高分辨率,从而在改进了我们的模型和高分辨率图像上实现升级的升级。