In this paper, we propose a highly practical fully online multi-object tracking and segmentation (MOTS) method that uses instance segmentation results as an input. The proposed method is based on the Gaussian mixture probability hypothesis density (GMPHD) filter, a hierarchical data association (HDA), and a mask-based affinity fusion (MAF) model to achieve high-performance online tracking. The HDA consists of two associations: segment-to-track and track-to-track associations. One affinity, for position and motion, is computed by using the GMPHD filter, and the other affinity, for appearance is computed by using the responses from a single object tracker such as a kernalized correlation filter. These two affinities are simply fused by using a score-level fusion method such as min-max normalization referred to as MAF. In addition, to reduce the number of false positive segments, we adopt mask IoU-based merging (mask merging). The proposed MOTS framework with the key modules: HDA, MAF, and mask merging, is easily extensible to simultaneously track multiple types of objects with CPU only execution in parallel processing. In addition, the developed framework only requires simple parameter tuning unlike many existing MOTS methods that need intensive hyperparameter optimization. In the experiments on the two popular MOTS datasets, the key modules show some improvements. For instance, ID-switch decreases by more than half compared to a baseline method in the training sets. In conclusion, our tracker achieves state-of-the-art MOTS performance in the test sets.
翻译:在本文中,我们提出了一个非常实用的、完全在线的多点跟踪和分解(MOTS)方法,该方法使用例分解结果作为输入。拟议方法基于高萨混合概率假设密度过滤器(GMPHD)过滤器、一个等级数据协会(HDA)和一个基于掩码的亲和聚合模型(MAF),以实现高性能在线跟踪。HDA由两个协会组成:分段对轨道和轨对轨联系。对于位置和运动而言,一个相似性是通过使用GMPHD过滤器来计算的,其他接近性,因为外观是通过使用一个单一对象追踪器(如内嵌化相关过滤器)的响应来计算的。这两种亲近性只是通过使用一个分数级混合组合方法(如MAFAF)来连接。此外,为了减少假正数部分的数量,我们采用了基于 IOU 的掩码合并(mask 合并) 。拟议的MOTS框架与关键模块:HDA、MAF和掩体合并,比较半级跟踪器的反应是简单化的半级标准, 需要同时运行多级的多级测试模型。