We present an approach for the planar surface reconstruction of a scene from images with limited overlap. This reconstruction task is challenging since it requires jointly reasoning about single image 3D reconstruction, correspondence between images, and the relative camera pose between images. Past work has proposed optimization-based approaches. We introduce a simpler approach, the PlaneFormer, that uses a transformer applied to 3D-aware plane tokens to perform 3D reasoning. Our experiments show that our approach is substantially more effective than prior work, and that several 3D-specific design decisions are crucial for its success.
 翻译:我们提出了一个从图像上用有限重叠的图像对场景进行平面重建的方法。 这一重建任务具有挑战性,因为它需要共同推理单一图像 3D 重建,图像之间的通信,图像之间的相对相机之间的图像。过去的工作提出了以优化为基础的方法。我们引入了一个更简单的方法,即“平板”,即3D-Wather平面标牌应用变压器来进行3D 推理。我们的实验表明,我们的方法比以前的工作要有效得多,而几个三D具体设计决定对于它的成功至关重要。