We present 360-DFPE, a sequential floor plan estimation method that directly takes 360-images as input without relying on active sensors or 3D information. Our approach leverages a loosely coupled integration between a monocular visual SLAM solution and a monocular 360-room layout approach, which estimate camera poses and layout geometries, respectively. Since our task is to sequentially capture the floor plan using monocular images, the entire scene structure, room instances, and room shapes are unknown. To tackle these challenges, we first handle the scale difference between visual odometry and layout geometry via formulating an entropy minimization process, which enables us to directly align 360-layouts without knowing the entire scene in advance. Second, to sequentially identify individual rooms, we propose a novel room identification algorithm that tracks every room along the camera exploration using geometry information. Lastly, to estimate the final shape of the room, we propose a shortest path algorithm with an iterative coarse-to-fine strategy, which improves prior formulations with higher accuracy and faster run-time. Moreover, we collect a new floor plan dataset with challenging large-scale scenes, providing both point clouds and sequential 360-image information. Experimental results show that our monocular solution achieves favorable performance against the current state-of-the-art algorithms that rely on active sensors and require the entire scene reconstruction data in advance.
翻译:我们提出360-DFPE,这是一套连续的地面规划估算方法,在不依赖主动传感器或3D信息的情况下,直接将360幅图像作为输入。我们的方法在单视视觉SLAM解决方案和单视360室布局方法之间拉动了松散的结合,分别对摄像头配置和布局几何进行估计。由于我们的任务是使用单视图像、整个场景结构、房间实例和房间形状来按顺序拍摄地板图。为了应对这些挑战,我们首先通过开发一个最小化诱导过程来处理视觉视像测量和布局几何测量之间的比例差异,这使我们能够直接将360幅布局与整个场景联系起来。第二,为了按顺序确定单个房间,我们建议采用新颖的房间鉴定算法,利用几何测量信息跟踪每个房间的形状。最后,我们提出一个使用高超偏差的直径直径直到直径方的战略,用更精确和更快的运行时间来改进先前的形状。此外,我们还收集一个新的地面规划数据集,在具有挑战性的大型场景场面图上直接定位,同时提供正向式的轨道定位的图像分析结果。