Visual target tracking is one of the most sought-after yet challenging research topics in computer vision. Given the ill-posed nature of the problem and its popularity in a broad range of real-world scenarios, a number of large-scale benchmark datasets have been established, on which considerable methods have been developed and demonstrated with significant progress in recent years -- predominantly by recent deep learning (DL)-based methods. This survey aims to systematically investigate the current DL-based visual tracking methods, benchmark datasets, and evaluation metrics. It also extensively evaluates and analyzes the leading visual tracking methods. First, the fundamental characteristics, primary motivations, and contributions of DL-based methods are summarized from nine key aspects of: network architecture, network exploitation, network training for visual tracking, network objective, network output, exploitation of correlation filter advantages, aerial-view tracking, long-term tracking, and online tracking. Second, popular visual tracking benchmarks and their respective properties are compared, and their evaluation metrics are summarized. Third, the state-of-the-art DL-based methods are comprehensively examined on a set of well-established benchmarks of OTB2013, OTB2015, VOT2018, LaSOT, UAV123, UAVDT, and VisDrone2019. Finally, by conducting critical analyses of these state-of-the-art trackers quantitatively and qualitatively, their pros and cons under various common scenarios are investigated. It may serve as a gentle use guide for practitioners to weigh when and under what conditions to choose which method(s). It also facilitates a discussion on ongoing issues and sheds light on promising research directions.
翻译:视觉目标跟踪是计算机视野中最受关注但最具挑战性的研究课题之一;鉴于这一问题的性质欠佳,在广泛的现实世界情景中受到广泛欢迎,已经建立了许多大型基准数据集,并在此基础上开发了相当多方法,近年来取得了显著进展,主要是最近深层学习(DL)方法;调查的目的是系统调查目前基于DL的视觉跟踪方法、基准数据集和评价指标;还广泛评价和分析主要视觉跟踪方法;第一,从以下九个关键方面总结了基于DL的方法的基本特点、主要动机和贡献:网络结构、网络开发、视觉跟踪网络培训、网络目标、网络输出、利用相关过滤优势、空中观察跟踪、长期跟踪和在线跟踪;第二,对大众视觉跟踪基准及其各自的属性进行了比较,并总结了评价指标;第三,根据OTB2013、OTB2015和以DL为主的情景选择方法,对OVA-DR的常规基准进行了全面审查;第二,对UTB-2015、VOT18和V-DL的常规分析,这些基准用于这些常规分析的常规分析。