In rate-distortion optimization, the encoder settings are determined by maximizing a reconstruction quality measure subject to a constraint on the bit rate. One of the main challenges of this approach is to define a quality measure that can be computed with low computational cost and which correlates well with perceptual quality. While several quality measures that fulfil these two criteria have been developed for images and video, no such one exists for 3D point clouds. We address this limitation for the video-based point cloud compression (V-PCC) standard by proposing a linear perceptual quality model whose variables are the V-PCC geometry and color quantization parameters and whose coefficients can easily be computed from two features extracted from the original 3D point cloud. Subjective quality tests with 400 compressed 3D point clouds show that the proposed model correlates well with the mean opinion score, outperforming state-of-the-art full reference objective measures in terms of Spearman rank-order and Pearsons linear correlation coefficient. Moreover, we show that for the same target bit rate, ratedistortion optimization based on the proposed model offers higher perceptual quality than rate-distortion optimization based on exhaustive search with a point-to-point objective quality metric.
翻译:在比例扭曲优化中,编码器设置是通过在比特率的限制下最大限度地使用重建质量衡量标准来确定的。这一方法的主要挑战之一是确定一种质量衡量标准,可以以低计算成本计算,并与感知质量密切相关。虽然为图像和视频开发了若干符合这两个标准的质量衡量标准,但对于3D点云则不存在这种质量衡量标准。我们通过提出一个线性感应质量模型来解决基于视频的点云压缩标准(V-PCC)的这一限制,该模型的变量是V-PCC的几何和色分化参数,而且其系数可以很容易地从最初的3D点云中提取的两个特征中计算出来。400个压缩的3D点云的主观质量测试表明,拟议的模型与平均意见评分密切相关,超过了Spearman级定和Pearson线性相关系数的全参考目标衡量标准。此外,我们还表明,对于同一目标位位位数而言,基于拟议模型的电压调整整形优化提供了高于目标质量的精确度搜索点。