Measuring the perceptual quality of images automatically is an essential task in the area of computer vision, as degradations on image quality can exist in many processes from image acquisition, transmission to enhancing. Many Image Quality Assessment(IQA) algorithms have been designed to tackle this problem. However, it still remains un settled due to the various types of image distortions and the lack of large-scale human-rated datasets. In this paper, we propose a novel algorithm based on the Swin Transformer [31] with fused features from multiple stages, which aggregates information from both local and global features to better predict the quality. To address the issues of small-scale datasets, relative rankings of images have been taken into account together with regression loss to simultaneously optimize the model. Furthermore, effective data augmentation strategies are also used to improve the performance. In comparisons with previous works, experiments are carried out on two standard IQA datasets and a challenge dataset. The results demonstrate the effectiveness of our work. The proposed method outperforms other methods on standard datasets and ranks 2nd in the no-reference track of NTIRE 2022 Perceptual Image Quality Assessment Challenge [53]. It verifies that our method is promising in solving diverse IQA problems and thus can be used to real-word applications.
翻译:自动测量图像的感知质量是计算机视觉领域的一项重要任务,因为图像质量的退化可能存在于从图像获取、传输到增强等许多过程中。许多图像质量评估(IQA)算法的设计是为了解决这一问题。然而,由于图像扭曲的种类多种多样,以及缺乏大规模人类评级数据集,它仍未得到解决。在本文件中,我们提议了基于Swin变形器[31]的新算法,该算法具有多个阶段的结合功能,将来自本地和全球特征的信息汇总起来,以更好地预测质量。为了解决小规模数据集的问题,图像的相对排序与回归损失一起被考虑在内,同时优化模型。此外,还采用有效的数据增强战略来改进性能。在与以往工作相比,在两个标准的IQA数据集和质疑数据集上进行了实验。结果显示我们的工作的有效性。拟议的方法在标准数据集方面优于其他方法,在标准数据集和不参照轨道中排行第二位,从而更好地预测质量。为了处理小规模数据集问题,将图像的相对排序与回归损失一起考虑,同时优化模型。此外,还使用了有效的数据增强战略来改进工作。在ISIREDI-I-I-Provial SIal Adal Adal Exvial 这样的版本中,我们使用了真实质量评估方法。