UAV tracking can be widely applied in scenarios such as disaster rescue, environmental monitoring, and logistics transportation. However, existing UAV tracking methods predominantly emphasize speed and lack exploration in semantic awareness, which hinders the search region from extracting accurate localization information from the template. The limitation results in suboptimal performance under typical UAV tracking challenges such as camera motion, fast motion, and low resolution, etc. To address this issue, we propose a dynamic semantic aware correlation modeling tracking framework. The core of our framework is a Dynamic Semantic Relevance Generator, which, in combination with the correlation map from the Transformer, explore semantic relevance. The approach enhances the search region's ability to extract important information from the template, improving accuracy and robustness under the aforementioned challenges. Additionally, to enhance the tracking speed, we design a pruning method for the proposed framework. Therefore, we present multiple model variants that achieve trade-offs between speed and accuracy, enabling flexible deployment according to the available computational resources. Experimental results validate the effectiveness of our method, achieving competitive performance on multiple UAV tracking datasets. The code is available at https://github.com/zxyyxzz/DSATrack.
翻译:无人机跟踪可广泛应用于灾害救援、环境监测和物流运输等场景。然而,现有无人机跟踪方法主要强调速度,在语义感知方面缺乏探索,这导致搜索区域难以从模板中提取精确定位信息。该局限性使得方法在相机运动、快速运动和低分辨率等典型无人机跟踪挑战下表现欠佳。为解决此问题,我们提出了一种动态语义感知相关建模跟踪框架。该框架的核心是一个动态语义相关性生成器,其结合Transformer产生的相关图来探索语义相关性。该方法增强了搜索区域从模板中提取重要信息的能力,从而提高了在上述挑战下的准确性和鲁棒性。此外,为提升跟踪速度,我们为该框架设计了一种剪枝方法。因此,我们提出了多种模型变体,实现了速度与精度之间的权衡,能够根据可用计算资源进行灵活部署。实验结果验证了本方法的有效性,在多个无人机跟踪数据集上取得了具有竞争力的性能。代码可在 https://github.com/zxyyxzz/DSATrack 获取。