The ability of accurate depth prediction by a convolutional neural network (CNN) is a major challenge for its wide use in practical visual simultaneous localization and mapping (SLAM) applications, such as enhanced camera tracking and dense mapping. This paper is set out to answer the following question: Can we tune a depth prediction CNN with the help of a visual SLAM algorithm even if the CNN is not trained for the current operating environment in order to benefit the SLAM performance? To this end, we propose a novel online adaptation framework consisting of two complementary processes: a SLAM algorithm that is used to generate keyframes to fine-tune the depth prediction and another algorithm that uses the online adapted depth to improve map quality. Once the potential noisy map points are removed, we perform global photometric bundle adjustment (BA) to improve the overall SLAM performance. Experimental results on both benchmark datasets and a real robot in our own experimental environments show that our proposed method improves the overall SLAM accuracy. While regularization has been shown to be effective in multi-task classification problems, we present experimental results and an ablation study to show the effectiveness of regularization in preventing catastrophic forgetting in the online adaptation of depth prediction, a single-task regression problem. In addition, we compare our online adaptation framework against the state-of-the-art pre-trained depth prediction CNNs to show that our online adapted depth prediction CNN outperforms the depth prediction CNNs that have been trained on a large collection of datasets.
翻译:由连锁神经网络(CNN)进行准确深度预测的能力是一个重大挑战,因为它广泛用于实际视觉同步本地化和绘图(SLAM)应用,例如强化相机跟踪和密集绘图等实际同步本地化和绘图(SLAM)应用,这是对其广泛使用的关键框架的一大挑战。本文件旨在回答以下问题:我们能否借助视觉SLAM算法,对CNN进行深度预测,即使CNN没有为当前运行环境而接受过有利于SLAM绩效的培训;为此,我们提议了一个由两个互补程序组成的新的在线适应框架:一个SLAM算法,用来生成关键框架,以微调深度预测和利用在线调整深度改进地图质量的另一种算法。一旦潜在的扰动地图点被删除,我们就进行全球光度捆绑调整(BA),以改善SLMM总体绩效。关于基准数据集和我们自己实验环境中的真正机器人的实验结果显示,我们提出的方法提高了SLMMM的总体准确性。尽管正规化在多重任务分类问题上是有效的,但我们提出了实验性结果,另一个算法研究,以显示正规化的实效,以显示在防止灾难性的深度预测中出现在线深度预测,我们在一次深度预测之前的升级。