Multiple object tracking (MOT) is a task in computer vision that aims to detect the position of various objects in videos and to associate them to a unique identity. We propose an approach based on Constraint Programming (CP) whose goal is to be grafted to any existing tracker in order to improve its object association results. We developed a modular algorithm divided into three independent phases. The first phase consists in recovering the tracklets provided by a base tracker and to cut them at the places where uncertain associations are spotted, for example, when tracklets overlap, which may cause identity switches. In the second phase, we associate the previously constructed tracklets using a Belief Propagation Constraint Programming algorithm, where we propose various constraints that assign scores to each of the tracklets based on multiple characteristics, such as their dynamics or the distance between them in time and space. Finally, the third phase is a rudimentary interpolation model to fill in the remaining holes in the trajectories we built. Experiments show that our model leads to improvements in the results for all three of the state-of-the-art trackers on which we tested it (3 to 4 points gained on HOTA and IDF1).
翻译:多重对象跟踪(MOT)是计算机视野中的一项任务,目的是检测视频中各种对象的位置,并将其与独特身份联系起来。我们建议基于限制程序(CP)的方法,其目标是与任何现有跟踪器挂钩,以改善其目标关联结果。我们开发了一个模块式算法,分为三个独立阶段。第一阶段是恢复由基准跟踪器提供的跟踪器提供的跟踪器,并在发现存在不确定关联的地方切除这些跟踪器,例如,在轨迹重叠时,可能导致身份开关。在第二阶段,我们使用信仰促进控制程序算法将以前建造的跟踪器连接在一起,我们提出各种限制,根据多种特征,例如其动态或时间和空间之间的距离,为每个跟踪器分配分数。最后,第三阶段是一个基本的内插模型,以填补我们建造的轨迹中剩余的漏洞。实验表明,我们的模型导致改进了我们测试它的所有三种状态跟踪器的结果(3至4个点在HORATA和UNFIF1上获得的)。