Digital humans are attracting more and more research interest during the last decade, the generation, representation, rendering, and animation of which have been put into large amounts of effort. However, the quality assessment of digital humans has fallen behind. Therefore, to tackle the challenge of digital human quality assessment issues, we propose the first large-scale quality assessment database for three-dimensional (3D) scanned digital human heads (DHHs). The constructed database consists of 55 reference DHHs and 1,540 distorted DHHs along with the subjective perceptual ratings. Then, a simple yet effective full-reference (FR) projection-based method is proposed to evaluate the visual quality of DHHs. The pretrained Swin Transformer tiny is employed for hierarchical feature extraction and the multi-head attention module is utilized for feature fusion. The experimental results reveal that the proposed method exhibits state-of-the-art performance among the mainstream FR metrics, which can provide an effective FR-IQA index for DHHs.
翻译:数字人类在过去十年中吸引了越来越多的研究兴趣,其生成、表述、制作和动画都投入了大量精力。然而,数字人类的质量评估已经落后。因此,为了应对数字人类质量评估问题的挑战,我们提议为三维(3D)扫描的数字人类头目(DHHs)建立第一个大规模质量评估数据库。建立的数据库由55个参考DHHs和1 540个扭曲的DHS以及主观概念评级组成。然后,提出一个简单而有效的全参照(FR)预测法,以评价DHS的视觉质量。预先训练的Swin变形器小用于分级特征提取,多头关注模块用于特征聚合。实验结果显示,拟议方法展示了主流FR指标中的最新性能,可为DHS提供有效的FR-IQA索引。