Recently, the Siamese-based method has stood out from multitudinous tracking methods owing to its state-of-the-art (SOTA) performance. Nevertheless, due to various special challenges in UAV tracking, \textit{e.g.}, severe occlusion, and fast motion, most existing Siamese-based trackers hardly combine superior performance with high efficiency. To this concern, in this paper, a novel attentional Siamese tracker (SiamAPN++) is proposed for real-time UAV tracking. By virtue of the attention mechanism, the attentional aggregation network (AAN) is conducted with self-AAN and cross-AAN, raising the expression ability of features eventually. The former AAN aggregates and models the self-semantic interdependencies of the single feature map via spatial and channel dimensions. The latter aims to aggregate the cross-interdependencies of different semantic features including the location information of anchors. In addition, the dual features version of the anchor proposal network is proposed to raise the robustness of proposing anchors, increasing the perception ability to objects with various scales. Experiments on two well-known authoritative benchmarks are conducted, where SiamAPN++ outperforms its baseline SiamAPN and other SOTA trackers. Besides, real-world tests onboard a typical embedded platform demonstrate that SiamAPN++ achieves promising tracking results with real-time speed.
翻译:最近,基于暹罗的追踪方法因其最先进的(SOTA)性能而与多层次追踪方法脱节,然而,由于在UAV跟踪、\textit{例如}、严重排斥和快速运动方面的各种特殊挑战,大多数现有基于暹罗的追踪器很难将优异性能与高效率结合起来。为此,本文件提议实时跟踪SiamAPN++(SiamAPN++)是一个全新的关注Siames跟踪器。此外,根据关注机制,关注聚合网络(AN)以自我-AAN和跨AAN(AAN)进行,最终提高了特征的表达能力。前AAN的汇总和模型通过空间和频道层面将单一特征图的自我调节相互依存性与高效率结合起来。后者的目的是将不同语系特征的交叉相互依存关系(包括锚定点位置信息)汇总在一起。此外,主控建议网络的双重特征版本是提高提出锚定键的力度,提高Si-AAN(AAN)固定速度和跨A(Si-NBA)的常规跟踪能力,而Si-BA(S-BA-BA)比A(S-BA-BAS-BA)更像)的模型是不同规模的实地测试。