Multi-point tracking is a challenging task that involves detecting points in the scene and tracking them across a sequence of frames. Computing detection-based measures like the F-measure on a frame-by-frame basis is not sufficient to assess the overall performance, as it does not interpret performance in the temporal domain. The main evaluation metric available comes from Multi-object tracking (MOT) methods to benchmark performance on datasets such as KITTI with the recently proposed higher order tracking accuracy (HOTA) metric, which is capable of providing a better description of the performance over metrics such as MOTA, DetA, and IDF1. While the HOTA metric takes into account temporal associations, it does not provide a tailored means to analyse the spatial associations of a dataset in a multi-camera setup. Moreover, there are differences in evaluating the detection task for points when compared to objects (point distances vs. bounding box overlap). Therefore in this work, we propose a multi-view higher order tracking metric (mvHOTA) to determine the accuracy of multi-point (multi-instance and multi-class) tracking methods, while taking into account temporal and spatial associations.mvHOTA can be interpreted as the geometric mean of detection, temporal, and spatial associations, thereby providing equal weighting to each of the factors. We demonstrate the use of this metric to evaluate the tracking performance on an endoscopic point detection dataset from a previously organised surgical data science challenge. Furthermore, we compare with other adjusted MOT metrics for this use-case, discuss the properties of mvHOTA, and show how the proposed multi-view Association and the Occlusion index (OI) facilitate analysis of methods with respect to handling of occlusions. The code is available at https://github.com/Cardio-AI/mvhota.
翻译:多点跟踪是一项具有挑战性的任务,它涉及探测现场各点,并在一组框架的顺序中跟踪这些点。计算基于检测的措施,如F度量法,在框架和框架基础上进行计算,不足以评估总体性能,因为它不能解释时间范围内的性能。主要评价指标来自多点跟踪(MOTT)方法,用以对诸如KITTI等数据集的性能进行基准评估,而最近提议的更高订单跟踪准确性(HOTA)指标,能够更清楚地描述MOTA、DetA和UNF1等计量的性能。 HOMTA 指标考虑到时间级的关联性能,但不能提供量性能分析空间组合的空间联系,此外,在对目标(点距离与绑框重叠)进行对比时,在评估点上,我们建议多点跟踪(多点和多级)的性能跟踪,同时对多点性能联盟的跟踪方法进行量性能分析,同时对时间和空间关系进行时间-空间联系进行时间-跟踪,从而对时间-时间-时间-轨道的测算进行数据分析。