Recent visual odometry (VO) methods incorporating geometric algorithm into deep-learning architecture have shown outstanding performance on the challenging monocular VO task. Despite encouraging results are shown, previous methods ignore the requirement of generalization capability under noisy environment and various scenes. To address this challenging issue, this work first proposes a novel optical flow network (PANet). Compared with previous methods that predict optical flow as a direct regression task, our PANet computes optical flow by predicting it into the discrete position space with optical flow probability volume, and then converting it to optical flow. Next, we improve the bundle adjustment module to fit the self-supervised training pipeline by introducing multiple sampling, ego-motion initialization, dynamic damping factor adjustment, and Jacobi matrix weighting. In addition, a novel normalized photometric loss function is advanced to improve the depth estimation accuracy. The experiments show that the proposed system not only achieves comparable performance with other state-of-the-art self-supervised learning-based methods on the KITTI dataset, but also significantly improves the generalization capability compared with geometry-based, learning-based and hybrid VO systems on the noisy KITTI and the challenging outdoor (KAIST) scenes.
翻译:最近将几何算法纳入深层学习架构的视觉测量(VO)方法显示,在具有挑战性的单流VO任务方面表现出色。尽管取得了令人鼓舞的成果,但以往的方法忽略了在噪音环境和各种场景下普遍化能力的要求。为解决这一具有挑战性的问题,这项工作首先提出了新的光学流网络(panet ) 。与以前预测光流作为直接回归任务的方法相比,我们的PANet通过预测光流进入具有光流概率的离散位置空间,然后将其转换为光流,从而计算光流。 其次,我们通过采用多种取样、自我感动初始化、动态阻隔动因子调整和雅各比基矩阵加权,改进捆绑式调整模块,以适应自我监督的培训管道。此外,为了提高深度估算准确性,新颖的标准化光度损失功能被推进。实验显示,拟议的系统不仅在KITTI数据集上实现了与其他最先进的自上可监督的学习方法的可比较性能,而且还大大改进了与基于几何、基于学习和混合的越野空间系统相比的总体化能力。