Deep variational autoencoders for image and video compression have gained significant attraction in the recent years, due to their potential to offer competitive or better compression rates compared to the decades long traditional codecs such as AVC, HEVC or VVC. However, because of complexity and energy consumption, these approaches are still far away from practical usage in industry. More recently, implicit neural representation (INR) based codecs have emerged, and have lower complexity and energy usage to classical approaches at decoding. However, their performances are not in par at the moment with state-of-the-art methods. In this research, we first show that INR based image codec has a lower complexity than VAE based approaches, then we propose several improvements for INR-based image codec and outperformed baseline model by a large margin.
翻译:近些年来,图像和视频压缩的深变自动编码器获得了显著的吸引力,因为与几十年的传统编码器相比,它们有可能提供竞争性或更好的压缩率,如AVC、HEVC或VVC。 然而,由于这些方法的复杂性和能源消耗,这些方法仍远未在工业中实际使用,最近出现了基于内隐神经代言(INR)的编码器,对传统解码方法的复杂程度和能量使用较低。然而,它们的性能与最新方法相比并不相同。在这项研究中,我们首先显示IRR基于图像编码器的图像编码器比VAE法的复杂程度要低,然后我们提出对IRR的图像代码器进行若干改进,并以大幅度超过基准模型。</s>