This paper presents improvements and novel additions to our recent work on end-to-end optimized hierarchical bi-directional video compression to further advance the state-of-the-art in learned video compression. As an improvement, we combine motion estimation and prediction modules and compress refined residual motion vectors for improved rate-distortion performance. As novel addition, we adapted the gain unit proposed for image compression to flexible-rate video compression in two ways: first, the gain unit enables a single encoder model to operate at multiple rate-distortion operating points; second, we exploit the gain unit to control bit allocation among intra-coded vs. bi-directionally coded frames by fine tuning corresponding models for truly flexible-rate learned video coding. Experimental results demonstrate that we obtain state-of-the-art rate-distortion performance exceeding those of all prior art in learned video coding.
翻译:本文介绍了我们最近关于端到端优化等级双向视频压缩工作的改进和新增加。 作为改进,我们把运动估计和预测模块结合起来,并压缩精细的残余运动矢量来改进调制率性能。作为新增加,我们把拟议图像压缩的增益单位改成灵活节率视频压缩用两种方式:第一,增益单位使单一编码器模型能够在多个调速操作点运作;第二,我们利用增益单位来控制内部编码与双向编码框架之间的比分分配,通过细微调整相应的模型来真正灵活化的学习视频编码。实验结果显示,我们获得了超过以往在学习视频编码中所有艺术的先进节率扭曲性能。