Most of the existing deep learning based end-to-end video coding (DLEC) architectures are designed specifically for RGB color format, yet the video coding standards, including H.264/AVC, H.265/HEVC and H.266/VVC developed over past few decades, have been designed primarily for YUV 4:2:0 format, where the chrominance (U and V) components are subsampled to achieve superior compression performances considering the human visual system. While a broad number of papers on DLEC compare these two distinct coding schemes in RGB domain, it is ideal to have a common evaluation framework in YUV 4:2:0 domain for a more fair comparison. This paper introduces a new DLEC architecture for video coding to effectively support YUV 4:2:0 and compares its performance against the HEVC standard under a common evaluation framework. The experimental results on YUV 4:2:0 video sequences show that the proposed architecture can outperform HEVC in intra-frame coding, however inter-frame coding is not as efficient on contrary to the RGB coding results reported in recent papers.
翻译:现有的基于端到端视频编码(DLEC)结构大多是专门为 RGB 颜色格式设计的,而过去几十年开发的视频编码标准,包括H264/AVC、H.265/HEVC和H.266/VC,主要是为YUV 4:2:0格式设计的,在该格式中,色度(U和V)组件进行了子取样,以达到考虑到人类视觉系统的优异压缩性能。虽然关于DLEC的大量文件比较了RGB域中这两个不同的编码方案,但理想的做法是在YUV 4:0域中有一个共同的评价框架,以便进行更公平的比较。本文介绍了一个新的DLEC结构,用于有效支持YUV 4:0的视频编码,并根据共同评价框架将其性能与HEVC标准进行比较。 YUV 4:2:0视频序列的实验结果显示,拟议的结构在内部编码中可以优于HVC,但内部编码并非与最近文件中报告的RGB编码结果相悖。