Tracking multiple athletes in sports videos is a very challenging Multi-Object Tracking (MOT) task, since athletes often have the same appearance and are intimately covered with each other, making a common occlusion problem becomes an abhorrent duplicate detection. In this paper, the duplicate detection is newly and precisely defined as occlusion misreporting on the same athlete by multiple detection boxes in one frame. To address this problem, we meticulously design a novel transformer-based Duplicate Detection Decontaminator (D$^3$) for training, and a specific algorithm Rally-Hungarian (RH) for matching. Once duplicate detection occurs, D$^3$ immediately modifies the procedure by generating enhanced boxes losses. RH, triggered by the team sports substitution rules, is exceedingly suitable for sports videos. Moreover, to complement the tracking dataset that without shot changes, we release a new dataset based on sports video named RallyTrack. Extensive experiments on RallyTrack show that combining D$^3$ and RH can dramatically improve the tracking performance with 9.2 in MOTA and 4.5 in HOTA. Meanwhile, experiments on MOT-series and DanceTrack discover that D$^3$ can accelerate convergence during training, especially save up to 80 percent of the original training time on MOT17. Finally, our model, which is trained only with volleyball videos, can be applied directly to basketball and soccer videos for MAT, which shows priority of our method. Our dataset is available at https://github.com/heruihr/rallytrack.
翻译:在体育录像中追踪多个运动员是一项非常具有挑战性的多球跟踪任务,因为运动员经常有相同的外观,而且相互相互密切覆盖,共同的封闭问题成为令人憎恶的重复检测。在本文中,重复的检测被明确定义为通过多个检测框在一个框内对同一个运动员进行隐蔽误报。为了解决这个问题,我们精心设计了一个基于变压器的新变压器的重复检测脱色器(3美元)用于培训,并设计了一个用于匹配的具体的Rally-Hungarian(RH)算法。一旦发现重复,D$3美元就会立即通过产生强化的箱损失来改变程序。在团队体育替代规则下触发的RH,非常适合于体育视频。此外,为了补充不发生变化的跟踪数据集,我们发布了一个基于体育视频名为RallyTrack的新数据集。 在RallyTrack上进行的广泛实验显示,只有D3美元和RHRH的组合才能大大改进跟踪工作,在MOTA和HOTA$4.5中进行9.2和4.5的跟踪。同时,在MOT-ROTA中进行实验,在M-ROT-CS-CS-CR可以直接发现我们的原始培训过程中,在80-C-ROD-C-C-C-LVD-LVAL-T-T-T-T-S-T-T-T-T-S-S-T-T-T-S-LV上,在原始培训中,在80-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-T-