In this paper, we propose a highly practical fully online multi-object tracking and segmentation (MOTS) method that uses instance segmentation results as an input in video. The proposed method exploits the Gaussian mixture probability hypothesis density (GMPHD) filter for online approach which is extended with a hierarchical data association (HDA) and a simple affinity fusion (SAF) model. HDA consists of segment-to-track and track-to-track associations. To build the SAF model, an affinity is computed by using the GMPHD filter that is represented by the Gaussian mixture models with position and motion mean vectors, and another affinity for appearance is computed by using the responses from single object tracker such as the kernalized correlation filters. These two affinities are simply fused by using a score-level fusion method such as Min-max normalization. In addition, to reduce false positive segments, we adopt Mask IoU based merging. In experiments, those key modules, i.e., HDA, SAF, and Mask merging show incremental improvements. For instance, ID-switch decreases by half compared to baseline method. In conclusion, our tracker achieves state-of-the-art level MOTS performance.
翻译:在本文中,我们提出了一个非常实用的、完全在线的多对象跟踪和分解(MOTS)方法,将例分解结果作为视频输入。拟议方法利用高斯混合概率假设密度(GMPHD)过滤器进行在线处理,该过滤器由等级数据协会(HDA)和简单的亲和聚合(SAF)模式扩展。HDA由分段对轨道和轨对轨联系组成。为了建立苏丹武装部队模型,我们通过使用Gaussian混合模型代表的GMPHD过滤器来计算一种亲和性。在实验中,由高斯混合模型代表的具有位置和运动平均矢量的GPHD、另一个外观的亲近性是通过使用单对象追踪器(如内层相关过滤器)的反应来计算的。这两个相近性只是通过使用分级混合法(如Min-max正常化)来结合。此外,我们采用Mask IoU为基础的组合。在实验中,这些关键模块,即HDADA、SA和Mask等组合显示渐进式的改进。例如,ID-DTS-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx