Single object tracking (SOT) is currently one of the most important tasks in computer vision. With the development of the deep network and the release for a series of large scale datasets for single object tracking, siamese networks have been proposed and perform better than most of the traditional methods. However, recent siamese networks get deeper and slower to obtain better performance. Most of these methods could only meet the needs of real-time object tracking in ideal environments. In order to achieve a better balance between efficiency and accuracy, we propose a simpler siamese network for single object tracking, which runs fast in poor hardware configurations while remaining an excellent accuracy. We use a more efficient regression method to compute the location of the tracked object in a shorter time without losing much precision. For improving the accuracy and speeding up the training progress, we introduce the Squeeze-and-excitation (SE) network into the feature extractor. In this paper, we compare the proposed method with some state-of-the-art trackers and analysis their performances. Using our method, a siamese network could be trained with shorter time and less data. The fast processing speed enables combining object tracking with object detection or other tasks in real time.
翻译:目前,单一对象跟踪(SOT)是计算机视觉中最重要的任务之一。随着深网络的开发,以及一系列大型数据集的发布,用于单一物体跟踪的大规模数据集系列的发布,我们提出了比大多数传统方法更好的系统网络,而且网络的运作比大多数传统方法要好。然而,最近的Siames网络越来越深入和慢,以取得更好的性能。这些方法大多只能满足在理想环境中实时物体跟踪的需要。为了更好地平衡效率和准确性,我们建议为单个物体跟踪建立一个更简单的 Siamese网络,这个网络在硬件配置不佳的情况下运行速度很快,同时保持极佳的准确性。我们使用一种更高效的回归方法,在较短的时间内计算被跟踪对象的位置,而不失去很多精确性。为了提高准确性和加快培训进度,我们将Squeze-Excistration (Se) 网络引入了特征提取器。在本文中,我们比较了拟议方法与某些最先进的跟踪器进行比较,并分析其性能。使用我们的方法,一个Siamesee网络可以用较短的时间和较慢的数据来训练。快速地处理其他物体的跟踪速度。