Recently, Learning-based image compression has reached comparable performance with traditional image codecs(such as JPEG, BPG, WebP). However, computational complexity and rate flexibility are still two major challenges for its practical deployment. To tackle these problems, this paper proposes two universal modules named Energy-based Channel Gating(ECG) and Bit-rate Modulator(BM), which can be directly embedded into existing end-to-end image compression models. ECG uses dynamic pruning to reduce FLOPs for more than 50\% in convolution layers, and a BM pair can modulate the latent representation to control the bit-rate in a channel-wise manner. By implementing these two modules, existing learning-based image codecs can obtain ability to output arbitrary bit-rate with a single model and reduced computation.
翻译:最近,基于学习的图像压缩已经达到与传统图像编码(如JPEG、BPG、WebP)相似的性能。然而,计算的复杂性和速度灵活性仍然是实际部署的两大挑战。为了解决这些问题,本文件提出了两个通用模块,即基于能源的频道Gate(ECG)和Bit-people Modulator(BM),这两个模块可以直接嵌入现有的端到端图像压缩模型中。 ECG使用动态剪裁来减少在卷发层50%以上的FLOP,而一个BM配对可以调整潜在代表,以频道方式控制比特速率。 通过实施这两个模块,现有的基于学习的图像编码器可以获得以单一模型和减少计算方式输出任意的位速率的能力。