Most existing trackers based on deep learning perform tracking in a holistic strategy, which aims to learn deep representations of the whole target for localizing the target. It is arduous for such methods to track targets with various appearance variations. To address this limitation, another type of methods adopts a part-based tracking strategy which divides the target into equal patches and tracks all these patches in parallel. The target state is inferred by summarizing the tracking results of these patches. A potential limitation of such trackers is that not all patches are equally informative for tracking. Some patches that are not discriminative may have adverse effects. In this paper, we propose to track the salient local parts of the target that are discriminative for tracking. In particular, we propose a fine-grained saliency mining module to capture the local saliencies. Further, we design a saliency-association modeling module to associate the captured saliencies together to learn effective correlation representations between the exemplar and the search image for state estimation. Extensive experiments on five diverse datasets demonstrate that the proposed method performs favorably against state-of-the-art trackers.
翻译:以深层学习为基础的大多数现有跟踪者在整体战略中进行跟踪,目的是了解对目标定位目标整体目标的深刻描述,对于这些方法来说,跟踪目标有各种各样的外观差异,这是十分艰巨的。为了应对这一限制,另一种方法采用基于部分的跟踪战略,将目标分割成平等的补丁,并同时跟踪所有这些补丁。通过总结这些补丁的跟踪结果推断出目标状态。这些跟踪者的潜在局限性是,并非所有补丁都同样为跟踪提供信息。一些非歧视性的补丁可能具有不利影响。在本文件中,我们提议跟踪目标中具有歧视性的突出地方部分。我们特别建议建立一个精细的显著采矿模块,以捕捉当地的显著特征。此外,我们设计了一个突出的关联模型,将所捕捉的显著特征结合起来,学习用于国家估算的缩略图和搜索图之间的有效关联表述。对五种不同的数据集进行广泛的实验表明,拟议的方法对州级跟踪者有利。