Face recognition can benefit from the utilization of depth data captured using low-cost cameras, in particular for presentation attack detection purposes. Depth video output from these capture devices can however contain defects such as holes or general depth inaccuracies. This work proposes a deep learning face depth enhancement method in this context of facial biometrics, which adds a security aspect to the topic. U-Net-like architectures are utilized, and the networks are compared against hand-crafted enhancer types, as well as a similar depth enhancer network from related work trained for an adjacent application scenario. All tested enhancer types exclusively use depth data as input, which differs from methods that enhance depth based on additional input data such as visible light color images. Synthetic face depth ground truth images and degraded forms thereof are created with help of PRNet, to train multiple deep learning enhancer models with different network sizes and training configurations. Evaluations are carried out on the synthetic data, on Kinect v1 images from the KinectFaceDB, and on in-house RealSense D435 images. These evaluations include an assessment of the falsification for occluded face depth input, which is relevant to biometric security. The proposed deep learning enhancers yield noticeably better results than the tested preexisting enhancers, without overly falsifying depth data when non-face input is provided, and are shown to reduce the error of a simple landmark-based PAD method.
翻译:利用低成本照相机采集的深度数据,特别是为了显示攻击探测的目的,可以对面部进行面部识别。这些捕获装置的深度视频输出可能包含孔或一般深度不准确性等缺陷。这项工作提议在面部生物鉴别学背景下采用深深学习面部强化方法,这增加了这个专题的安全方面。利用了U-Net类结构,将网络与手工制作的增强型号进行比较,并比对相邻应用情景培训的相关工作进行类似的深度增强网络。所有测试过的增强型都专门使用深度数据作为输入,这不同于根据可见光彩色图像等额外输入数据而提高深度的方法。在PRNet的帮助下,合成面部面部深度地面真相图像及其退化的形式得以创建,以培训具有不同网络规模和培训配置的多个深层强化型模型。对合成数据、Kinect V1型图像和内部RealSense D435型图像进行了比较。这些评价包括评估对隐蔽面深度输入的伪造数据,这些方法不同于可见面深度图像。在不进行深度测试的情况下,为更精确的深度数据更新了预测度。