Many mobile manufacturers recently have adopted Dual-Pixel (DP) sensors in their flagship models for faster auto-focus and aesthetic image captures. Despite their advantages, research on their usage for 3D facial understanding has been limited due to the lack of datasets and algorithmic designs that exploit parallax in DP images. This is because the baseline of sub-aperture images is extremely narrow and parallax exists in the defocus blur region. In this paper, we introduce a DP-oriented Depth/Normal network that reconstructs the 3D facial geometry. For this purpose, we collect a DP facial data with more than 135K images for 101 persons captured with our multi-camera structured light systems. It contains the corresponding ground-truth 3D models including depth map and surface normal in metric scale. Our dataset allows the proposed matching network to be generalized for 3D facial depth/normal estimation. The proposed network consists of two novel modules: Adaptive Sampling Module and Adaptive Normal Module, which are specialized in handling the defocus blur in DP images. Finally, the proposed method achieves state-of-the-art performances over recent DP-based depth/normal estimation methods. We also demonstrate the applicability of the estimated depth/normal to face spoofing and relighting.
翻译:许多移动制造商最近在其旗舰模型中采用了双像(DP)传感器,以便更快的自动聚焦和美学图像捕捉。尽管这些传感器具有优势,但关于3D面部理解的DP-Pixel(DP)传感器研究有限,因为缺乏利用DP图像中的parllax的数据集和算法设计。这是因为次孔图的基线非常狭窄,在偏角模糊的区域存在parallax。在本文件中,我们引入了一个以DP为导向的深度/低温网络,以重建3D面部几何学。为此目的,我们收集了101人用多摄像头结构灯系统摄取的DP-135K图像的DP面部数据。它包含相应的地面3D模型,包括深度图和标准尺度表层正常度。我们的数据集使得拟议的匹配网络能够用于3D面部面部深度/正常度估计。拟议网络由两个新的模块组成:适应性抽样模块和适应性常态常态模块,专门处理DP图像中脱焦度模糊部分。最后,拟议方法还实现了最新DP/正常深度估计的状态。