3D reconstruction is a fundamental problem in computer vision, and the task is especially challenging when the object to reconstruct is partially or fully occluded. We introduce a method that uses the shadows cast by an unobserved object in order to infer the possible 3D volumes behind the occlusion. We create a differentiable image formation model that allows us to jointly infer the 3D shape of an object, its pose, and the position of a light source. Since the approach is end-to-end differentiable, we are able to integrate learned priors of object geometry in order to generate realistic 3D shapes of different object categories. Experiments and visualizations show that the method is able to generate multiple possible solutions that are consistent with the observation of the shadow. Our approach works even when the position of the light source and object pose are both unknown. Our approach is also robust to real-world images where ground-truth shadow mask is unknown.
翻译:3D 重建是计算机视觉中的一个基本问题,当重建对象部分或完全隐蔽时,任务尤其具有挑战性。我们引入了一种方法,使用未观测对象投下的阴影来推断隐蔽后可能的 3D 体积。我们创建了一种不同的图像形成模型,使我们能够共同推断一个对象的 3D 形状、其外形和光源的位置。由于该方法是端到端可变的,因此我们能够将所学到的物体几何学前科整合起来,以便产生不同对象类别的现实的 3D 形状。实验和可视化显示该方法能够产生与观察阴影一致的多种可能的解决方案。我们的方法即使在光源和物体的定位都未知的情况下也起作用。我们的方法对于地面图象蒙面未知的真实世界图像也很有力。