Object tracking is the cornerstone of many visual analytics systems. While considerable progress has been made in this area in recent years, robust, efficient, and accurate tracking in real-world video remains a challenge. In this paper, we present a hybrid tracker that leverages motion information from the compressed video stream and a general-purpose semantic object detector acting on decoded frames to construct a fast and efficient tracking engine suitable for a number of visual analytics applications. The proposed approach is compared with several well-known recent trackers on the OTB tracking dataset. The results indicate advantages of the proposed method in terms of speed and/or accuracy. Another advantage of the proposed method over most existing trackers is its simplicity and deployment efficiency, which stems from the fact that it reuses and re-purposes the resources and information that may already exist in the system for other reasons.
翻译:物体跟踪是许多视觉分析系统的基石。 虽然近年来在这一领域取得了相当大的进展,但现实世界视频的可靠、高效和准确跟踪仍然是一个挑战。在本文件中,我们提出了一个混合跟踪器,利用压缩视频流的移动信息,并使用一个通用语义物体探测器在解码框架上采取行动,以建立一个适合若干视觉分析应用的快速高效跟踪引擎。提议的方法与几个众所周知的近期 OTB 跟踪数据集的跟踪器进行了比较。结果显示拟议方法在速度和/或准确性方面的优势。相对于大多数现有跟踪器而言,拟议方法的另一个优势是其简单和部署效率,其原因是它重新利用和重新利用了系统中可能存在的其他原因的资源和信息。