Driven by recent advances in object detection with deep neural networks, the tracking-by-detection paradigm has gained increasing prevalence in the research community of multi-object tracking (MOT). It has long been known that appearance information plays an essential role in the detection-to-track association, which lies at the core of the tracking-by-detection paradigm. While most existing works consider the appearance distances between the detections and the tracks, they ignore the statistical information implied by the historical appearance distance records in the tracks, which can be particularly useful when a detection has similar distances with two or more tracks. In this work, we propose a hybrid track association (HTA) algorithm that models the historical appearance distances of a track with an incremental Gaussian mixture model (IGMM) and incorporates the derived statistical information into the calculation of the detection-to-track association cost. Experimental results on three MOT benchmarks confirm that HTA effectively improves the target identification performance with a small compromise to the tracking speed. Additionally, compared to many state-of-the-art trackers, the DeepSORT tracker equipped with HTA achieves better or comparable performance in terms of the balance of tracking quality and speed.
翻译:近些年来,在与深神经网络的物体探测方面出现了进步,因此,跟踪和检测模式在多物体跟踪研究界越来越普遍。人们早已知道,外观信息在探测到轨联系方面发挥着至关重要的作用,这是跟踪到轨联系的核心。虽然大多数现有工作都考虑到探测到轨道之间的外观距离,但它们忽视了轨道上历史外观记录所隐含的统计资料,如果探测距离与两条或多条轨道相近,这种统计资料可能特别有用。在这项工作中,我们建议采用混合轨联系算法,用递增高斯混合模型(IGMM)模拟轨道的历史外观距离,并将衍生的统计资料纳入探测到轨联系成本的计算中。MOT的三个基准的实验结果证实,HTA有效地改进了目标识别工作,对跟踪速度作了小的妥协。此外,与许多州级跟踪者相比,配备HTA的深SOR跟踪器在跟踪质量和速度方面达到更好或可比的业绩。