In this paper, an essential problem of robust visual odometry (VO) is approached by incorporating geometry-based methods into deep-learning architecture in a self-supervised manner. Generally, pure geometry-based algorithms are not as robust as deep learning in feature-point extraction and matching, but perform well in ego-motion estimation because of their well-established geometric theory. In this work, a novel optical flow network (PANet) built on a position-aware mechanism is proposed first. Then, a novel system that jointly estimates depth, optical flow, and ego-motion without a typical network to learning ego-motion is proposed. The key component of the proposed system is an improved bundle adjustment module containing multiple sampling, initialization of ego-motion, dynamic damping factor adjustment, and Jacobi matrix weighting. In addition, a novel relative photometric loss function is advanced to improve the depth estimation accuracy. The experiments show that the proposed system not only outperforms other state-of-the-art methods in terms of depth, flow, and VO estimation among self-supervised learning-based methods on KITTI dataset, but also significantly improves robustness compared with geometry-based, learning-based and hybrid VO systems. Further experiments show that our model achieves outstanding generalization ability and performance in challenging indoor (TMU-RGBD) and outdoor (KAIST) scenes.
翻译:本文以自我监督的方式将基于几何方法的几何方法纳入深层学习架构,从而解决了稳健视觉计量学(VO)的基本问题。一般而言,纯粹几何算法在特征点提取和匹配方面不如深层学习强,而是在自我感化评估方面表现良好,因为其具有完善的几何理论。在这项工作中,首先提出了一个基于定位认知机制的新颖的光学流动网络(Panet)建议。随后,提出了一个新的系统,在没有典型的学习自我感动网络的情况下,共同估计深度、光学流动和自我感动。拟议系统的关键组成部分是改进的捆绑式调整模块,包含多重抽样、自我感动初始化、动态阻力要素调整和雅各比矩阵加权等内容。此外,一个新的相对光度损失功能正在推进,以提高深度估算准确性。实验表明,拟议的系统不仅在深度、流动和自上超前的基于内部智能的学习方法(KITTITI数据集的模型和基于内部空间空间空间数据系统)之间超越了其他最新的最新方法。此外,还大大改进了我们具有挑战性的实地空间空间实验的能力。