The greatest challenge facing visual object tracking is the simultaneous requirements on robustness and discrimination power. In this paper, we propose a SiamFC-based tracker, named SPM-Tracker, to tackle this challenge. The basic idea is to address the two requirements in two separate matching stages. Robustness is strengthened in the coarse matching (CM) stage through generalized training while discrimination power is enhanced in the fine matching (FM) stage through a distance learning network. The two stages are connected in series as the input proposals of the FM stage are generated by the CM stage. They are also connected in parallel as the matching scores and box location refinements are fused to generate the final results. This innovative series-parallel structure takes advantage of both stages and results in superior performance. The proposed SPM-Tracker, running at 120fps on GPU, achieves an AUC of 0.687 on OTB-100 and an EAO of 0.434 on VOT-16, exceeding other real-time trackers by a notable margin.
翻译:视觉物体跟踪面临的最大挑战是同时要求稳健性和歧视力量。在本文中,我们提议用一个以SiamFC为基础的追踪器,名为SPM-Tracker,来应对这一挑战。基本的想法是分两个不同的匹配阶段解决这两项要求。通过普遍培训,在粗糙的匹配(CM)阶段得到加强,而通过远程学习网络在精细匹配(FM)阶段增强歧视力量。两个阶段是系列连接的,因为调频阶段的投入建议是由CM阶段产生的。它们也同时连接,因为匹配得分和箱位置的精细化结合,以产生最终结果。这个创新的系列平行结构利用了两个阶段的优势和优异性。拟议的SPM-Tracer,在GPU上运行120英尺,在OT-100上实现了0.687的AUC,在VOT-16上实现了0.434的EAO,在显著的幅度上超过了其他实时跟踪器。