Light field imaging can capture both the intensity information and the direction information of light rays. It naturally enables a six-degrees-of-freedom viewing experience and deep user engagement in virtual reality. Compared to 2D image assessment, light field image quality assessment (LFIQA) needs to consider not only the image quality in the spatial domain but also the quality consistency in the angular domain. However, there is a lack of metrics to effectively reflect the angular consistency and thus the angular quality of a light field image (LFI). Furthermore, the existing LFIQA metrics suffer from high computational costs due to the excessive data volume of LFIs. In this paper, we propose a novel concept of "anglewise attention" by introducing a multihead self-attention mechanism to the angular domain of an LFI. This mechanism better reflects the LFI quality. In particular, we propose three new attention kernels, including anglewise self-attention, anglewise grid attention, and anglewise central attention. These attention kernels can realize angular self-attention, extract multiangled features globally or selectively, and reduce the computational cost of feature extraction. By effectively incorporating the proposed kernels, we further propose our light field attentional convolutional neural network (LFACon) as an LFIQA metric. Our experimental results show that the proposed LFACon metric significantly outperforms the state-of-the-art LFIQA metrics. For the majority of distortion types, LFACon attains the best performance with lower complexity and less computational time.
翻译:光场成像可以捕获光线的强度信息和方向信息。它自然地带来六自由度观看体验和深度用户参与度的虚拟现实。与 2D 图像评估相比,光场图像质量评估(LFIQA)需要考虑空间域的图像质量和角度域的质量一致性。然而,缺少有效反映角度一致性和因此光场图像(LFI)角度质量的度量标准。此外,由于 LFI 的数据量过大,现有的 LFIQA 度量标准计算成本很高。本文通过在 LFI 的角度域引入多头自注意机制,提出了“角度注意力”的新概念,从而更好地反映了 LFI 的质量。特别地,我们提出了三个新的注意力卷积核,包括角度自注意力,角度网格注意力和角度中心注意力。这些注意力卷积核可以实现角度自注意力、全局或选择地提取多角度特征,并降低特征提取的计算成本。通过有效地整合所提出的卷积核,进一步提出了我们的光场注意卷积神经网络(LFACon)作为 LFIQA 度量标准。我们的实验结果表明,所提出的 LFACon 度量标准明显优于现有的 LFIQA 度量标准。对于大多数失真类型,LFACon 实现了更好的性能,复杂度更低,计算时间更少。