We propose a new Generative Adversarial Network for Compressed Video quality Enhancement (CVEGAN). The CVEGAN generator benefits from the use of a novel Mul2Res block (with multiple levels of residual learning branches), an enhanced residual non-local block (ERNB) and an enhanced convolutional block attention module (ECBAM). The ERNB has also been employed in the discriminator to improve the representational capability. The training strategy has also been re-designed specifically for video compression applications, to employ a relativistic sphere GAN (ReSphereGAN) training methodology together with new perceptual loss functions. The proposed network has been fully evaluated in the context of two typical video compression enhancement tools: post-processing (PP) and spatial resolution adaptation (SRA). CVEGAN has been fully integrated into the MPEG HEVC video coding test model (HM16.20) and experimental results demonstrate significant coding gains (up to 28% for PP and 38% for SRA compared to the anchor) over existing state-of-the-art architectures for both coding tools across multiple datasets.
翻译:我们提议建立一个新的压缩视频质量强化创能反反转网络(CVEGAN),CVEGAN生成器得益于使用新的Mul2Res块(包括多层次的留级学习分支)、强化的剩余非本地块(ERNB)和增强的革命块关注模块(ECBAM),ERPB也被用于歧视者,以提高代表能力。培训战略还专门为视频压缩应用程序重新设计了培训战略,以采用相对球GAN(RESPEREGAN)培训方法以及新的感知损失功能。在两种典型的视频压缩强化工具(后处理(PP)和空间分辨率适应(SRA))的背景下,对拟议的网络进行了充分评价。CVEGAN已完全纳入MPEG HEVC视频编码测试模型(HM16.20),实验结果显示,在多个数据集的当前两个编码工具的状态艺术结构上取得了显著的连带收益(PPP至28%,SRAGA值为38%,与锚值相比,SRAAN值为38%)。