How would you fairly evaluate two multi-object tracking algorithms (i.e. trackers), each one employing a different object detector? Detectors keep improving, thus trackers can make less effort to estimate object states over time. Is it then fair to compare a new tracker employing a new detector with another tracker using an old detector? In this paper, we propose a novel performance measure, named Tracking Effort Measure (TEM), to evaluate trackers that use different detectors. TEM estimates the improvement that the tracker does with respect to its input data (i.e. detections) at frame level (intra-frame complexity) and sequence level (inter-frame complexity). We evaluate TEM over well-known datasets, four trackers and eight detection sets. Results show that, unlike conventional tracking evaluation measures, TEM can quantify the effort done by the tracker with a reduced correlation on the input detections. Its implementation is publicly available online at https://github.com/vpulab/MOT-evaluation.
翻译:如何公平地评估两种多物体跟踪算法( 即跟踪器)? 探测器不断改进, 从而跟踪器可以减少在时间上估计物体状态的努力。 那么, 使用旧探测器将使用新探测器的新追踪器与另一个追踪器比较是否公平? 在本文中, 我们提议了一个新的业绩计量, 名为跟踪 Efffort 度量法( TEM ), 以评价使用不同探测器的跟踪器。 TEM 估计了跟踪器在框架级别( 跨框架复杂度) 和序列级别( 跨框架复杂度) 输入数据( 检测) 方面的改进。 我们用众所周知的数据集、 4个跟踪器和8个检测器来评估TEM 。 结果显示, 与常规跟踪评估措施不同, TEM 可以量化跟踪器的工作, 减少输入检测的关联性。 其实施情况可在https://github. com/ vpulab/MOT- asserview 上公开查阅 。