This paper proposes a novel method for online Multi-Object Tracking (MOT) using Graph Convolutional Neural Network (GCNN) based feature extraction and end-to-end feature matching for object association. The Graph based approach incorporates both appearance and geometry of objects at past frames as well as the current frame into the task of feature learning. This new paradigm enables the network to leverage the "context" information of the geometry of objects and allows us to model the interactions among the features of multiple objects. Another central innovation of our proposed framework is the use of the Sinkhorn algorithm for end-to-end learning of the associations among objects during model training. The network is trained to predict object associations by taking into account constraints specific to the MOT task. Experimental results demonstrate the efficacy of the proposed approach in achieving top performance on the MOT '15, '16, '17 and '20 Challenges among state-of-the-art online approaches. The code is available at https://github.com/IPapakis/GCNNMatch.
翻译:本文提出一种新型方法,用于在线多物体跟踪(MOT),使用基于图形进化神经网络(GCNNN)的特征提取和终端到终端功能对目标关联进行匹配。基于图形的方法将过去框架的物体外观和几何以及当前框架纳入特征学习任务。这一新模式使网络能够利用物体几何的“翻版”信息,并使我们能够模拟多个物体特征之间的相互作用。我们拟议框架的另一个核心创新是使用Sinkhorn算法,在示范培训期间对对象之间的关联进行端到端学习。网络接受培训,以预测对象关联,同时考虑到MOT任务特有的制约因素。实验结果显示,拟议方法在MOT'15,'16,'17 和'20 最新在线方法中实现顶级业绩方面的效力。该代码可在https://github.com/IPapakis/GCNNMatch中查阅。