神经重新预测错误: 合并地貌学习和相机波斯估计 (Neural Reprojection Error: Merging Feature Learning and Camera Pose Estimation)

Absolute camera pose estimation is usually addressed by sequentially solving two distinct subproblems: First a feature matching problem that seeks to establish putative 2D-3D correspondences, and then a Perspective-n-Point problem that minimizes, with respect to the camera pose, the sum of so-called Reprojection Errors (RE). We argue that generating putative 2D-3D correspondences 1) leads to an important loss of information that needs to be compensated as far as possible, within RE, through the choice of a robust loss and the tuning of its hyperparameters and 2) may lead to an RE that conveys erroneous data to the pose estimator. In this paper, we introduce the Neural Reprojection Error (NRE) as a substitute for RE. NRE allows to rethink the camera pose estimation problem by merging it with the feature learning problem, hence leveraging richer information than 2D-3D correspondences and eliminating the need for choosing a robust loss and its hyperparameters. Thus NRE can be used as training loss to learn image descriptors tailored for pose estimation. We also propose a coarse-to-fine optimization method able to very efficiently minimize a sum of NRE terms with respect to the camera pose. We experimentally demonstrate that NRE is a good substitute for RE as it significantly improves both the robustness and the accuracy of the camera pose estimate while being computationally and memory highly efficient. From a broader point of view, we believe this new way of merging deep learning and 3D geometry may be useful in other computer vision applications.

翻译：绝对摄像头的表面估计通常通过按顺序解决两个截然不同的子问题来解决:首先,一个特征匹配问题,寻求建立2D-3D类假设通信,然后是透视点问题,最大限度地减少相机的表面,即所谓的重新预测错误(RE)的总和。我们争辩说,产生2D-3D类假设通信,1 导致信息的重大损失,需要尽可能在可再生能源内部通过选择稳健的损失和调整其超参数和2来补偿。可能导致一个错误数据传递给配置估计器的RE。在本文件中,我们引入神经再预测错误(NRE),以替代RE。 NRE允许重新思考相机造成的估计问题,将它与特征学习问题结合起来,从而利用比2D-3D类更丰富的信息,并消除选择稳健损失及其超参数的需要。因此,NRE可以用作培训损失,以学习为估计而定制的深度图像描述器。我们还建议从隐蔽点到更精确的图像校正应用,同时我们提议从隐蔽点选择一个高清晰的准确度的精确度的图像,同时以高精度的精确度来展示一个高度的精确度的精确度的图像。