Recently, the Siamese-based method has stood out from multitudinous tracking methods owing to its state-of-the-art (SOTA) performance. Nevertheless, due to various special challenges in UAV tracking, \textit{e.g.}, severe occlusion and fast motion, most existing Siamese-based trackers hardly combine superior performance with high efficiency. To this concern, in this paper, a novel attentional Siamese tracker (SiamAPN++) is proposed for real-time UAV tracking. By virtue of the attention mechanism, we conduct a special attentional aggregation network (AAN) consisting of self-AAN and cross-AAN for raising the representation ability of features eventually. The former AAN aggregates and models the self-semantic interdependencies of the single feature map via spatial and channel dimensions. The latter aims to aggregate the cross-interdependencies of two different semantic features including the location information of anchors. In addition, the anchor proposal network based on dual features is proposed to raise its robustness of tracking objects with various scales. Experiments on two well-known authoritative benchmarks are conducted, where SiamAPN++ outperforms its baseline SiamAPN and other SOTA trackers. Besides, real-world tests onboard a typical embedded platform demonstrate that SiamAPN++ achieves promising tracking results with real-time speed.
翻译:最近,基于暹罗的追踪方法因其最先进的(SOTA)性能而与多层次追踪方法脱节,然而,由于在UAV跟踪、\textit{例如}、严重隔离和快速运动方面的各种特殊挑战,大多数现有基于暹罗的追踪器几乎没有将优异性能与高效率结合起来,为此,本文件提议为实时UAV跟踪提供一个新的关注的Siaamese追踪器(SiamAPN+++),根据关注机制,我们建立了一个由自我AN和跨AAN组成的特别关注聚合网络(AN),以最终提高地貌代表能力。前AAN的汇总和模型通过空间和频道层面将单一地貌图的自发性相互依存性与自高效率结合起来。后者旨在将两种不同语系特征的交叉性关系(SiaamAPN++)汇总在一起,包括锚定地点信息。此外,还提议基于双重特征的定位建议网络,以提高其跟踪目标的稳健性,用各种标准的Si-AAN和跨A(Siampal-A)轨道进行真正的Siam-A测试。