Automatic Perceptual Image Quality Assessment is a challenging problem that impacts billions of internet, and social media users daily. To advance research in this field, we propose a Mixture of Experts approach to train two separate encoders to learn high-level content and low-level image quality features in an unsupervised setting. The unique novelty of our approach is its ability to generate low-level representations of image quality that are complementary to high-level features representing image content. We refer to the framework used to train the two encoders as Re-IQA. For Image Quality Assessment in the Wild, we deploy the complementary low and high-level image representations obtained from the Re-IQA framework to train a linear regression model, which is used to map the image representations to the ground truth quality scores, refer Figure 1. Our method achieves state-of-the-art performance on multiple large-scale image quality assessment databases containing both real and synthetic distortions, demonstrating how deep neural networks can be trained in an unsupervised setting to produce perceptually relevant representations. We conclude from our experiments that the low and high-level features obtained are indeed complementary and positively impact the performance of the linear regressor. A public release of all the codes associated with this work will be made available on GitHub.
翻译:自动视觉图像质量评估是一个具有挑战性的问题,每天影响着数十亿互联网和社交媒体用户。为了促进这一领域的研究,我们提出了一种混合专家的方法,利用无监督的方式训练两个独立的编码器,学习高层次的内容特征和低层次的图像质量特征。我们方法的独特之处在于,它可以产生与表示图像内容的高层次特征互补的图像质量的低层次表示形式。我们将用于训练两个编码器的框架称为Re-IQA。对于自然场景下的图像质量评估,我们使用从Re-IQA框架获得的互补低和高层次图像表示来训练线性回归模型,该模型将图像表示映射到实际质量分数,具体请参见图1。我们的方法在多个大规模图像质量评估数据库(包括真实扭曲和合成扭曲)上实现了最先进的性能,展示了如何在无监督的情况下训练深度神经网络来产生感知相关特征。我们的实验结果表明,获得的低层次和高层次特征确实是互补的,并且对线性回归器的性能有积极的影响。我们将在GitHub上公开发布与本研究相关的全部代码。