High-precision camera re-localization technology in a pre-established 3D environment map is the basis for many tasks, such as Augmented Reality, Robotics and Autonomous Driving. The point-based visual re-localization approaches are well-developed in recent decades, but are insufficient in some feature-less cases. In this paper, we design a complete pipeline for camera pose refinement with points and lines, which contains the innovatively designed line extracting CNN named VLSE, the line matching and the pose optimization approaches. We adopt a novel line representation and customize a hybrid convolution block based on the Stacked Hourglass network, to detect accurate and stable line features on images. Then we apply a geometric-based strategy to obtain precise 2D-3D line correspondences using epipolar constraint and reprojection filtering. A following point-line joint cost function is constructed to optimize the camera pose with the initial coarse pose from the pure point-based localization. Sufficient experiments are conducted on open datasets, i.e, line extractor on Wireframe and YorkUrban, localization performance on InLoc duc1 and duc2, to confirm the effectiveness of our point-line joint pose optimization method.
翻译:在预先建立的 3D 环境地图中,高精度摄像头重新定位技术是许多任务的基础,例如增强现实、机器人和自动驱动等任务的基础。近几十年来,基于点的视觉重新定位方法得到了完善,但在一些没有特色的案例中还不够。在本文中,我们设计了一个完整的照相管道,用点和线进行精细的改进,其中包括创新设计的线条提取CNN的名为VLSE、线匹配和成形优化方法。我们采用了一种新型的线条代表制,并定制了一个以堆叠式玻璃网络为基础的混合卷积块,以探测图像上的准确和稳定的线性特征。然后,我们运用基于几何测基的战略,利用表面限制和再预测过滤法获得精确的 2D-3D 线对应。下面的点联线成本功能是用纯点定位本地化的最初粗糙的布局来优化相机的布局。我们对开放的数据集进行了充分的实验,即对Wreframy和约克Urban网络的线提取器和约克Urban网络进行充分的实验,以确认我们的优化联合定位方法的本地化。