When comparing learned image/video restoration and compression methods, it is common to report peak-signal to noise ratio (PSNR) results. However, there does not exist a generally agreed upon practice to compute PSNR for sets of images or video. Some authors report average of individual image/frame PSNR, which is equivalent to computing a single PSNR from the geometric mean of individual image/frame mean-square error (MSE). Others compute a single PSNR from the arithmetic mean of frame MSEs for each video. Furthermore, some compute the MSE/PSNR of Y-channel only, while others compute MSE/PSNR for RGB channels. This paper investigates different approaches to computing PSNR for sets of images, single video, and sets of video and the relation between them. We show the difference between computing the PSNR based on arithmetic vs. geometric mean of MSE depends on the distribution of MSE over the set of images or video, and that this distribution is task-dependent. In particular, these two methods yield larger differences in restoration problems, where the MSE is exponentially distributed and smaller differences in compression problems, where the MSE distribution is narrower. We hope this paper will motivate the community to clearly describe how they compute reported PSNR values to enable consistent comparison.
翻译:在比较所学的图像/图像恢复和压缩方法时,通常会报告峰值与噪音比率(PSNR)的峰值与噪声比率(PSNR)的结果。然而,目前没有普遍商定的计算图像或视频组合的PSNR的计算做法。有些作者报告个人图像/框架PSNR的平均值,这相当于从单个图像/框架平均方差的几何平均值计算单一的PSNR。另一些作者则根据每部视频的MSSE的算术平均值计算单一的PSNR。此外,有些则只计算Y频道的MSE/PRSNR,而另一些则计算RGB频道的MSE/PNR。本文调查了为图像、单一视频和视频组合及其关系计算PSNR的不同方法。我们显示了根据计算个人图像/框架平均差对单个图像/框架平均差(MSR)的差别取决于MSE对成套图像或视频框架的分布情况,而这种分布取决于任务。特别是这两种方法在恢复问题中产生更大的差异,在这些方面,MSE的分布将明确反映的深度差异。