Standardized benchmarks have been crucial in pushing the performance of computer vision algorithms, especially since the advent of deep learning. Although leaderboards should not be over-claimed, they often provide the most objective measure of performance and are therefore important guides for research. We present MOTChallenge, a benchmark for single-camera Multiple Object Tracking (MOT) launched in late 2014, to collect existing and new data, and create a framework for the standardized evaluation of multiple object tracking methods. The benchmark is focused on multiple people tracking, since pedestrians are by far the most studied object in the tracking community, with applications ranging from robot navigation to self-driving cars. This paper collects the first three releases of the benchmark: (i) MOT15, along with numerous state-of-the-art results that were submitted in the last years, (ii) MOT16, which contains new challenging videos, and (iii) MOT17, that extends MOT16 sequences with more precise labels and evaluates tracking performance on three different object detectors. The second and third release not only offers a significant increase in the number of labeled boxes but also provide labels for multiple object classes beside pedestrians, as well as the level of visibility for every single object of interest. We finally provide a categorization of state-of-the-art trackers and a broad error analysis. This will help newcomers understand the related work and research trends in the MOT community, and hopefully shred some light into potential future research directions.
翻译:标准化基准对于推动计算机愿景算法的绩效至关重要,特别是自深层次学习以来,在推进计算机愿景算法的绩效方面至关重要。虽然领导板不应该被过分夸大,但它们往往提供最客观的业绩衡量标准,因此是重要的研究指南。我们介绍了2014年底推出的单相机多物体跟踪(MOT)基准MTCHallenge,以收集现有和新的数据,并为多物体跟踪方法的标准化评价建立一个框架。基准侧重于多人的跟踪,因为行人是跟踪社区中研究最多的对象,其应用范围从机器人导航到自驾驶汽车不等。本文收集了基准头三个版本的首三个版本:(一) MOT15,以及过去几年中提交的许多最新成果;(二) MOT16,其中载有新的具有挑战性的视频,以及(三) MOT17,以更精确的标签扩展了MOT16序列,并评估了三个不同物体探测器的运行情况。第二版和第三版的发布不仅显著增加了标注目标箱的数量,而且还为多类目标未来研究的清晰度提供了标签,最终提供了我们所了解的一行距的一行距的一行距,并且提供了我们所了解的一行距的一行距的一行距的清晰度。