Image compression has raised widespread interest recently due to its significant importance for multimedia storage and transmission. Meanwhile, a reliable image quality assessment (IQA) for compressed images can not only help to verify the performance of various compression algorithms but also help to guide the compression optimization in turn. In this paper, we design a full-reference image quality assessment metric SwinIQA to measure the perceptual quality of compressed images in a learned Swin distance space. It is known that the compression artifacts are usually non-uniformly distributed with diverse distortion types and degrees. To warp the compressed images into the shared representation space while maintaining the complex distortion information, we extract the hierarchical feature representations from each stage of the Swin Transformer. Besides, we utilize cross attention operation to map the extracted feature representations into a learned Swin distance space. Experimental results show that the proposed metric achieves higher consistency with human's perceptual judgment compared with both traditional methods and learning-based methods on CLIC datasets.
翻译:图像压缩最近引起了广泛的关注, 因为它对多媒体存储和传输非常重要。 与此同时, 压缩图像的可靠图像质量评估( IQA) 不仅有助于验证各种压缩算法的性能, 也有助于反过来引导压缩优化。 在本文中, 我们设计了一个全参考图像质量评估 指标 SwinIQA, 以测量学习的 Swin距离空间中压缩图像的感知质量 。 众所周知, 压缩制品通常不是以不同扭曲类型和程度以非统一方式分布的。 要将压缩图像转换到共享的显示空间, 同时保持复杂的扭曲信息, 我们从 Swin Transforder 的每个阶段提取等级特征表。 此外, 我们利用交叉关注操作将提取的特征显示绘制成一个学习的 Swin 距离空间。 实验结果表明, 与传统的方法和基于学习的CCIC 数据集方法相比, 拟议的指标提高了人类感知判断的一致性 。