Feature tracking is the building block of many applications such as visual odometry, augmented reality, and target tracking. Unfortunately, the state-of-the-art vision-based tracking algorithms fail in surgical images due to the challenges imposed by the nature of such environments. In this paper, we proposed a novel and unified deep learning-based approach that can learn how to track features reliably as well as learn how to detect such reliable features for tracking purposes. The proposed network dubbed as Deep-PT, consists of a tracker network which is a convolutional neural network simulating cross-correlation in terms of deep learning and two fully connected networks that operate on the output of intermediate layers of the tracker to detect features and predict trackability of the detected points. The ability to detect features based on the capabilities of the tracker distinguishes the proposed method from previous algorithms used in this area and improves the robustness of the algorithms against dynamics of the scene. The network is trained using multiple datasets due to the lack of specialized dataset for feature tracking datasets and extensive comparisons are conducted to compare the accuracy of Deep-PT against recent pixel tracking algorithms. As the experiments suggest, the proposed deep architecture deliberately learns what to track and how to track and outperforms the state-of-the-art methods.
翻译:不幸的是,最先进的视觉跟踪算法在外科图像中由于这种环境的性质所带来的挑战而未能在外科图像中发挥作用。在本文件中,我们提议了一种新型和统一的深层次学习基础方法,可以学习如何可靠地跟踪特征,并学习如何为跟踪目的探测这类可靠特征。拟议的网络称为深点跟踪,由跟踪网络组成,这是一个跟踪网络,是一个动态神经网络,它模拟了深层学习方面的交叉关系,两个完全连接的网络,在跟踪器中间层的产出上运行,以探测所探测到的点的特征并预测其可追踪性。根据跟踪器的能力,检测所拟议的方法与该地区以前使用的算法有区别,并改进算法与现场动态的稳健性。由于缺少用于跟踪特征数据集的专门数据集和广泛比较,对网络进行了培训,以比较深点跟踪的准确性与最近测点的跟踪方法,以及拟议跟踪的跟踪方法与最近测距的跟踪方法如何比较。