In-loop filtering is used in video coding to process the reconstructed frame in order to remove blocking artifacts. With the development of convolutional neural networks (CNNs), CNNs have been explored for in-loop filtering considering it can be treated as an image de-noising task. However, in addition to being a distorted image, the reconstructed frame is also obtained by a fixed line of block based encoding operations in video coding. It carries coding-unit based coding distortion of some similar characteristics. Therefore, in this paper, we address the filtering problem from two aspects, global appearance restoration for disrupted texture and local coding distortion restoration caused by fixed pipeline of coding. Accordingly, a three-stream global appearance and local coding distortion based fusion network is developed with a high-level global feature stream, a high-level local feature stream and a low-level local feature stream. Ablation study is conducted to validate the necessity of different features, demonstrating that the global features and local features can complement each other in filtering and achieve better performance when combined. To the best of our knowledge, we are the first one that clearly characterizes the video filtering process from the above global appearance and local coding distortion restoration aspects with experimental verification, providing a clear pathway to developing filter techniques. Experimental results demonstrate that the proposed method significantly outperforms the existing single-frame based methods and achieves 13.5%, 11.3%, 11.7% BD-Rate saving on average for AI, LDP and RA configurations, respectively, compared with the HEVC reference software.
翻译:在视频编码中,使用环形过滤器处理重建框架的编码,以消除阻塞文物。随着进化神经神经网络(CNNs)的发展,已经探索了CNN的内圈过滤器,认为它可以被视为一个图像去除任务。然而,除了被扭曲的图像外,在视频编码中,通过基于块编码操作的固定线路来获取重建框架。它含有一些类似特性的基于编码的扭曲。因此,在本文中,我们从两个方面处理过滤问题,即由于固定的编码管道而中断的纹理和本地编码扭曲的恢复全球外观。因此,三流全球外观和基于本地编码的扭曲网络可以被视为一个图像去除任务的任务。除了被歪曲的图像外,还用视频编码编码中基于块的固定编码操作的固定线路来获取。7 因此,在过滤和取得更好的业绩时,我们从两个方面都从两个方面着手过滤问题,即:为固定的调试管道所干扰的文本和本地的变异性变异性结构。我们第一次将分别用来将基于高层次的图像修正方法描述出一个显著的图像升级方法,从上面的B级修正过程。