The user experience in adaptive HTTP streaming relies on offering bitrate ladders with suitable operation points for all users and typically involves multiple resolutions. While open GOP coding structures are generally known to provide substantial coding efficiency benefit, their use in HTTP streaming has been precluded through lacking support of reference picture resampling (RPR) in AVC and HEVC. The newly emerging Versatile Video Coding (VVC) standard supports RPR, but only conversational scenarios were primarily investigated during the design of VVC. This paper aims at enabling usage of RPR in HTTP streaming scenarios through analysing the drift potential of VVC coding tools and presenting a constrained encoding method that avoids severe drift artefacts in resolution switching with open GOP coding in VVC. In typical live streaming configurations, the presented method achieves -8.57% BD-rate reduction compared to closed GOP coding while in a typical Video on Demand configuration, -1.89% BD-rate reduction is reported. The constraints penalty compared to regular open GOP coding is 0.65% BD-rate in the worst case. The presented method was integrated into the publicly available open source VVC encoder VVenC v0.3.
翻译:适应性 HTTP 流流的用户经验取决于提供比特拉梯,为所有用户提供合适的操作点,通常涉及多个分辨率。虽然众所周知,开放的 GOP 编码结构可以提供大量编码效率效益,但由于缺乏对AVC和HIVC的参考图片复制(RPR)的支持,这些结构无法用于HTTP流流。 新兴的Versatile Video Coding (VC) 标准支持RPR,但在VVC设计期间,只对对话情景进行了调查。 本文的目的是通过分析VVVC 编码工具的漂移潜力和提出有限制的编码方法,避免在用VVVC 的开放式 GOP 编码转换分辨率时发生严重流动。 在典型的活流配置中,所介绍的方法比封闭的GOP 编码减少了8.57% BD 率,而在典型的关于需求配置的视频典型视频中,报告BD-8.9% BD 率递减。在最坏的案例中,与常规的 GOP 公开 GOP 编码为0.65% BD-rat 的制约处罚是VC 。