In this paper we propose a generative adversarial network (GAN) framework to enhance the perceptual quality of compressed videos. Our framework includes attention and adaptation to different quantization parameters (QPs) in a single model. The attention module exploits global receptive fields that can capture and align long-range correlations between consecutive frames, which can be beneficial for enhancing perceptual quality of videos. The frame to be enhanced is fed into the deep network together with its neighboring frames, and in the first stage features at different depths are extracted. Then extracted features are fed into attention blocks to explore global temporal correlations, followed by a series of upsampling and convolution layers. Finally, the resulting features are processed by the QP-conditional adaptation module which leverages the corresponding QP information. In this way, a single model can be used to enhance adaptively to various QPs without requiring multiple models specific for every QP value, while having similar performance. Experimental results demonstrate the superior performance of the proposed PeQuENet compared with the state-of-the-art compressed video quality enhancement algorithms.
翻译:在本文中,我们提出一个基因对抗网络框架,以提高压缩视频的感官质量。我们的框架包括关注和调整单一模型中不同的量化参数。关注模块利用全球可接受字段,可以捕捉和调整连续框架之间的长距离关联,这有利于提高视频的感化质量。要增强的框架与其相邻框架一起被注入深网络,并在第一阶段在不同深度上提取。然后将提取的功能输入关注区块,以探索全球时间相关性,然后进行一系列的升级和演进层。最后,由此产生的功能由利用相应QP信息的QP条件适应模块处理。这样,可以使用一个单一模型,在不要求针对每个QP价值的多个具体模型的情况下,同时具有类似的性能。实验结果表明,与最先进的压缩视频质量算法相比,拟议的PeQuENet的优异性表现。