We consider the challenging problem of outdoor lighting estimation for the goal of photorealistic virtual object insertion into photographs. Existing works on outdoor lighting estimation typically simplify the scene lighting into an environment map which cannot capture the spatially-varying lighting effects in outdoor scenes. In this work, we propose a neural approach that estimates the 5D HDR light field from a single image, and a differentiable object insertion formulation that enables end-to-end training with image-based losses that encourage realism. Specifically, we design a hybrid lighting representation tailored to outdoor scenes, which contains an HDR sky dome that handles the extreme intensity of the sun, and a volumetric lighting representation that models the spatially-varying appearance of the surrounding scene. With the estimated lighting, our shadow-aware object insertion is fully differentiable, which enables adversarial training over the composited image to provide additional supervisory signal to the lighting prediction. We experimentally demonstrate that our hybrid lighting representation is more performant than existing outdoor lighting estimation methods. We further show the benefits of our AR object insertion in an autonomous driving application, where we obtain performance gains for a 3D object detector when trained on our augmented data.
翻译:我们考虑了户外照明估计在照片中插入符合现实的虚拟物体的目标方面具有挑战性的问题。现有的户外照明估计工作通常将现场照明简化为环境图,无法捕捉户外场景中空间变化的照明效应。在这项工作中,我们提出了一个神经学方法,从单一图像中估算5DHD光场,以及一个不同的物体插入配方,以便能够进行端对端培训,使其了解基于图像的损失,从而鼓励现实主义。具体地说,我们设计了一个适合户外场景的混合照明代表面,其中包含一个处理太阳极端强度的《人类发展报告》天空圆顶,以及一个能模拟周围场景空间变化外观的体积照明代表面。由于估计照明,我们的影子觉物体插入完全可以截然不同,使得对复合图像的对抗性培训能够为照明预测提供更多的监督信号。我们实验性地证明,我们的混合照明代表比现有的户外照明估计方法更能表现。我们进一步展示了在自主驾驶应用程序中插入我们的AR物体的好处,我们在那里获得3D物体探测器在强化数据培训时的性工作成果。