Subjective image-quality measurement plays a critical role in the development of image-processing applications. The purpose of a visual-quality metric is to approximate the results of subjective assessment. In this regard, more and more metrics are under development, but little research has considered their limitations. This paper addresses that deficiency: we show how image preprocessing before compression can artificially increase the quality scores provided by the popular metrics DISTS, LPIPS, HaarPSI, and VIF as well as how these scores are inconsistent with subjective-quality scores. We propose a series of neural-network preprocessing models that increase DISTS by up to 34.5%, LPIPS by up to 36.8%, VIF by up to 98.0%, and HaarPSI by up to 22.6% in the case of JPEG-compressed images. A subjective comparison of preprocessed images showed that for most of the metrics we examined, visual quality drops or stays unchanged, limiting the applicability of these metrics.
翻译:主观图像质量测量在发展图像处理应用程序方面发挥着关键作用。 视觉质量度量的目的是近似主观评估的结果。 在这方面,正在开发越来越多的指标,但研究很少考虑其局限性。 本文述及这一缺陷:我们展示了压缩前图像预处理如何能人为地增加广受欢迎的DISTS、 LPIPS、HaarPSI和VIF提供的质量分数,以及这些分数与主观质量分数不相符。 我们提出了一系列神经网络预处理模型,将DISTS增加34.5%, LPIPS增加36.8%,VIF增加98.0%,HaarPSI增加22.6%,JPEG压缩图像的主观比较表明,对于我们所审查的大多数指标而言,视觉质量投放或保持不变,限制了这些指标的适用性。