As the successor of H.265/HEVC, the new versatile video coding standard (H.266/VVC) can provide up to 50% bitrate saving with the same subjective quality, at the cost of increased decoding complexity. To accelerate the application of the new coding standard, a real-time H.266/VVC software decoder that can support various platforms is implemented, where SIMD technologies, parallelism optimization, and the acceleration strategies based on the characteristics of each coding tool are applied. As the mobile devices have become an essential carrier for video services nowadays, the mentioned optimization efforts are not only implemented for the x86 platform, but more importantly utilized to highly optimize the decoding performance on the ARM platform in this work. The experimental results show that when running on the Apple A14 SoC (iPhone 12pro), the average single-thread decoding speed of the present implementation can achieve 53fps (RA and LB) for full HD (1080p) bitstreams generated by VTM-11.0 reference software using 8bit Common Test Conditions (CTC). When multi-threading is enabled, an average of 32 fps (RA) can be achieved when decoding the 4K bitstreams.
翻译:作为H.265/HEVC的继承者,新的多功能视频编码标准(H.266/VVC)可以以更高的解码复杂性,以更高的解码复杂性为代价,以同样的主观质量提供高达50%的比特节节节,加速应用新的编码标准,即实时H.266/VC软件编码器,支持各种平台,采用SIMD技术、平行优化和基于每个编码工具特点的加速战略。由于移动设备已成为目前视频服务的基本载体,上述优化努力不仅为x86平台实施,而且更重要的是,利用了在这项工作中高度优化ARM平台的解码性能。实验结果表明,在使用苹果A14 SoC(iPhone 12pro)运行时,目前实施的平均单线解码速度可以达到53fps(RA和LB)全HD(1080p)位参考软件生成的位元流,使用8位通用测试条件(CTC)生成的VTM-11.0参考软件。当多读时,可以实现平均32FD流。