Visual object tracking is an important task in computer vision, which has many real-world applications, e.g., video surveillance, visual navigation. Visual object tracking also has many challenges, e.g., object occlusion and deformation. To solve above problems and track the target accurately and efficiently, many tracking algorithms have emerged in recent years. This paper presents the rationale and representative works of two most popular tracking frameworks in past ten years, i.e., the corelation filter and Siamese network for object tracking. Then we present some deep learning based tracking methods categorized by different network structures. We also introduce some classical strategies for handling the challenges in tracking problem. Further, this paper detailedly present and compare the benchmarks and challenges for tracking, from which we summarize the development history and development trend of visual tracking. Focusing on the future development of object tracking, which we think would be applied in real-world scenes before some problems to be addressed, such as the problems in long-term tracking, low-power high-speed tracking and attack-robust tracking. In the future, the integration of multimodal data, e.g., the depth image, thermal image with traditional color image, will provide more solutions for visual tracking. Moreover, tracking task will go together with some other tasks, e.g., video object detection and segmentation.
翻译:视觉物体跟踪是计算机视觉中的一项重要任务,它有许多现实世界应用,例如视频监视、视觉导航等。视觉物体跟踪也有许多挑战,例如物体隔离和变形。为了解决上述问题并准确和有效地跟踪目标,近年来出现了许多跟踪算法。本文介绍了过去十年中两个最受欢迎的跟踪框架的理由和代表性工作,即核心过滤器和Siamese天体跟踪网络。然后我们介绍了一些由不同网络结构分类的深层次学习跟踪方法。我们还介绍了一些处理追踪问题挑战的典型战略。此外,本文详细介绍并比较了跟踪的基准和挑战,我们从中总结了视觉跟踪的发展历史和发展趋势。侧重于未来天体跟踪的发展,我们认为,在需要解决一些问题之前,这两个框架将应用到真实世界的场景中,例如,核心过滤器过滤器和Siamse网络跟踪。未来,将采用一些多式数据集成,例如深度图像、热路路段跟踪任务,以及传统图像跟踪任务。将提供一些解决方案,例如深度图像跟踪、热路段跟踪等。