This paper introduces pixel-wise prediction based visual odometry (PWVO), which is a dense prediction task that evaluates the values of translation and rotation for every pixel in its input observations. PWVO employs uncertainty estimation to identify the noisy regions in the input observations, and adopts a selection mechanism to integrate pixel-wise predictions based on the estimated uncertainty maps to derive the final translation and rotation. In order to train PWVO in a comprehensive fashion, we further develop a data generation workflow for generating synthetic training data. The experimental results show that PWVO is able to deliver favorable results. In addition, our analyses validate the effectiveness of the designs adopted in PWVO, and demonstrate that the uncertainty maps estimated by PWVO is capable of capturing the noises in its input observations.
翻译:本文介绍了以像素预测为基础的视觉观察测量(PWVO),这是一项密集的预测任务,它评估了投入观测中每个像素的翻译和旋转值。PWVO使用不确定性估计来查明输入观测中的噪音区域,并采用一个选择机制,根据估计的不确定地图将像素预测综合起来,以得出最后翻译和旋转。为了全面培训PWVO,我们进一步开发了一个生成合成培训数据的数据收集工作流程。实验结果显示,PWVO能够产生有利的结果。此外,我们的分析证实PWVO所采用的设计的有效性,并表明PWVO估计的不确定地图能够捕捉其输入观测中的噪音。