The modern supervised approaches for human image relighting rely on training data generated from 3D human models. However, such datasets are often small (e.g., Light Stage data with a small number of individuals) or limited to diffuse materials (e.g., commercial 3D scanned human models). Thus, the human relighting techniques suffer from the poor generalization capability and synthetic-to-real domain gap. In this paper, we propose a two-stage method for single-image human relighting with domain adaptation. In the first stage, we train a neural network for diffuse-only relighting. In the second stage, we train another network for enhancing non-diffuse reflection by learning residuals between real photos and images reconstructed by the diffuse-only network. Thanks to the second stage, we can achieve higher generalization capability against various cloth textures, while reducing the domain gap. Furthermore, to handle input videos, we integrate illumination-aware deep video prior to greatly reduce flickering artifacts even with challenging settings under dynamic illuminations.
 翻译:现代人类图像光化监督方法依赖于3D人类模型产生的培训数据。然而,这类数据集往往规模小(例如,与少数个人共享的光相数据),或仅限于扩散材料(例如,商用3D扫描人类模型),因此,人类光化技术受到一般化能力差和合成到现实领域差距的困扰。在本文中,我们提出了单光人光化的两阶段方法,同时对域进行了适应。在第一阶段,我们培训了一个只进行扩散光化的神经网络。在第二阶段,我们培训了另一个网络,通过学习真实照片和通过扩散光化网络重建的图像之间的残留物来强化非渗透性反射。在第二阶段,我们可以针对各种布质纹物实现更高的普遍化能力,同时缩小域间差距。此外,为了处理输入视频,我们整合了有色的深层视频,然后在充满挑战的环境下大量减少闪烁的工艺品。