Accurate and robust visual object tracking is one of the most challenging and fundamental computer vision problems. It entails estimating the trajectory of the target in an image sequence, given only its initial location, and segmentation, or its rough approximation in the form of a bounding box. Discriminative Correlation Filters (DCFs) and deep Siamese Networks (SNs) have emerged as dominating tracking paradigms, which have led to significant progress. Following the rapid evolution of visual object tracking in the last decade, this survey presents a systematic and thorough review of more than 90 DCFs and Siamese trackers, based on results in nine tracking benchmarks. First, we present the background theory of both the DCF and Siamese tracking core formulations. Then, we distinguish and comprehensively review the shared as well as specific open research challenges in both these tracking paradigms. Furthermore, we thoroughly analyze the performance of DCF and Siamese trackers on nine benchmarks, covering different experimental aspects of visual tracking: datasets, evaluation metrics, performance, and speed comparisons. We finish the survey by presenting recommendations and suggestions for distinguished open challenges based on our analysis.
翻译:精确和稳健的视觉物体跟踪是计算机视觉视觉视觉视觉最困难和最根本的问题之一,它需要根据最初位置和分解,或以捆绑盒的形式粗近似,以图像序列估计目标的轨迹,仅考虑到其最初位置和分解,或其粗略近似。相异的关联过滤器(DCFs)和深浅的暹马斯网络(SNS)已成为主导性跟踪模式,从而导致重大的进展。在过去十年视觉物体跟踪的快速演变之后,本次调查根据9个跟踪基准的结果,对90多个DCFs和Siamsese跟踪器进行了系统、彻底的审查。首先,我们介绍了DCF和Siams跟踪核心配方的背景理论。然后,我们区分并全面审查了这两个跟踪模式的共同和具体的公开研究挑战。此外,我们透彻分析了DCF和Siams跟踪器在9个基准上的绩效,涵盖视觉跟踪的不同实验方面:数据集、评价指标、性能和速度比较。我们完成调查,通过根据我们的分析,提出不同公开挑战的建议和建议来完成调查。