Expanding an existing tourist photo from a partially captured scene to a full scene is one of the desired experiences for photography applications. Although photo extrapolation has been well studied, it is much more challenging to extrapolate a photo (i.e., selfie) from a narrow field of view to a wider one while maintaining a similar visual style. In this paper, we propose a factorized neural re-rendering model to produce photorealistic novel views from cluttered outdoor Internet photo collections, which enables the applications including controllable scene re-rendering, photo extrapolation and even extrapolated 3D photo generation. Specifically, we first develop a novel factorized re-rendering pipeline to handle the ambiguity in the decomposition of geometry, appearance and illumination. We also propose a composited training strategy to tackle the unexpected occlusion in Internet images. Moreover, to enhance photo-realism when extrapolating tourist photographs, we propose a novel realism augmentation process to complement appearance details, which automatically propagates the texture details from a narrow captured photo to the extrapolated neural rendered image. The experiments and photo editing examples on outdoor scenes demonstrate the superior performance of our proposed method in both photo-realism and downstream applications.
翻译:将现有旅游照片从部分摄取的场景扩展为全场是摄影应用的预期经验之一。虽然对照片外推法进行了仔细研究,但将照片(即自相)从狭窄的视野领域外推至更宽的视野领域,同时保持类似的视觉风格,则更具挑战性得多。在本文中,我们提出了一个因子化神经再演化模型,以产生由封闭的户外互联网照片收藏的摄影现实主义新观点,使这些应用能够包括可控场景再演、照片外推甚至外推3D照片生成。具体地说,我们首先开发了一个新的因子再演化管道,以处理地理测量、外观和光照的分解的模糊性。我们还提出了一个综合培训战略,以解决互联网图像中出乎意料的隐蔽性。此外,为了在对游客照片进行外推时加强摄影现实主义,我们提出了一个新的真实性增强过程,以补充外观细节,将文字细节从狭窄的拍摄照片到外推变现的内光成图像。我们所提出的摄影和下游图像应用的高级性演示和摄影演化方法展示。