In recent years, there has been widespread attention drawn to convolutional neural network (CNN) based blind image quality assessment (IQA). A large number of works start by extracting deep features from CNN. Then, those features are processed through spatial average pooling (SAP) and fully connected layers to predict quality. Inspired by full reference IQA and texture features, in this paper, we extend SAP ($1^{st}$ moment) into spatial moment pooling (SMP) by incorporating higher order moments (such as variance, skewness). Moreover, we provide learning friendly normalization to circumvent numerical issue when computing gradients of higher moments. Experimental results suggest that simply upgrading SAP to SMP significantly enhances CNN-based blind IQA methods and achieves state of the art performance.
翻译:近年来,人们广泛关注以变幻神经网络为基础的盲人图像质量评估(IQA),大量工作从CNN的深层特征开始,然后通过空间平均集合(SAP)和完全相连的层进行处理,以预测质量。受完整参考IQA和纹理特征的启发,本文将SAP($$$ moment)扩大到空间时速集合(SMP),将更高的顺序时刻(如差异、偏差)纳入其中。此外,我们提供学习的友好正常化,以避免在计算更高时刻的梯度时出现数字问题。实验结果表明,只要将SAP升级为SMP,就可以大大增强基于CNN的盲人IQA方法并实现艺术性能状态。