Video and image quality assessment has long been projected as a regression problem, which requires predicting a continuous quality score given an input stimulus. However, recent efforts have shown that accurate quality score regression on real-world user-generated content (UGC) is a very challenging task. To make the problem more tractable, we propose two new methods - binary, and ordinal classification - as alternatives to evaluate and compare no-reference quality models at coarser levels. Moreover, the proposed new tasks convey more practical meaning on perceptually optimized UGC transcoding, or for preprocessing on media processing platforms. We conduct a comprehensive benchmark experiment of popular no-reference quality models on recent in-the-wild picture and video quality datasets, providing reliable baselines for both evaluation methods to support further studies. We hope this work promotes coarse-grained perceptual modeling and its applications to efficient UGC processing.
翻译:长期以来,视频和图像质量评估一直被预测为一个回归问题,这要求根据投入刺激因素预测持续的质量评分。然而,最近的努力表明,在现实世界用户生成的内容(UGC)上准确的质量评分回归是一项非常艰巨的任务。为了使这一问题更加易于处理,我们提出了两种新方法,即二进制和交替分类,作为在粗金刚石一级评价和比较无参考质量模型的替代方法。此外,拟议的新任务对概念优化的UGC转换或媒体处理平台的预处理具有更实际的意义。我们对近期的网上图片和视频质量数据集进行了流行的不参考质量模型的全面基准实验,为两种评估方法提供了可靠的基线以支持进一步研究。我们希望这项工作能够促进粗化的观念模型及其应用于高效的UGC处理。