In this paper, we investigate the problem of bit allocation in Neural Video Compression (NVC). First, we reveal that a recent bit allocation approach claimed to be optimal is, in fact, sub-optimal due to its implementation. Specifically, we find that its sub-optimality lies in the improper application of semi-amortized variational inference (SAVI) on latent with non-factorized variational posterior. Then, we show that the corrected version of SAVI on non-factorized latent requires recursively applying back-propagating through gradient ascent, based on which we derive the corrected optimal bit allocation algorithm. Due to the computational in-feasibility of the corrected bit allocation, we design an efficient approximation to make it practical. Empirical results show that our proposed correction significantly improves the incorrect bit allocation in terms of R-D performance and bitrate error, and outperforms all other bit allocation methods by a large margin. The source code is provided in the supplementary material.
翻译:在本文中,我们调查神经视频压缩(NVC)中的比特分配问题。 首先,我们发现,最近一个所谓最佳的比特分配方法,事实上是次优的。具体地说,我们发现,其亚优度在于对非因子变异后继体的潜伏应用半摊分变率(SAVI)的不当。然后,我们表明,关于非因子变异潜伏的SAVI的校正版本需要通过梯度回溯式再分析,我们据此得出经校正的最佳比特分配算法。由于被校正的比特分配的计算不可行,我们设计了一个高效的近似性使之实用。根据经验,我们提出的修正结果显示,我们提议的修正大大改进了R-D性能和比特率错误方面的不正确的比特分配,并大大超越了所有其他比特分配方法。源代码在补充材料中提供。