Although the manipulating of the unmanned aerial manipulator (UAM) has been widely studied, vision-based UAM approaching, which is crucial to the subsequent manipulating, generally lacks effective design. The key to the visual UAM approaching lies in object tracking, while current UAM tracking typically relies on costly model-based methods. Besides, UAM approaching often confronts more severe object scale variation issues, which makes it inappropriate to directly employ state-of-the-art model-free Siamese-based methods from the object tracking field. To address the above problems, this work proposes a novel Siamese network with pairwise scale-channel attention (SiamSA) for vision-based UAM approaching. Specifically, SiamSA consists of a pairwise scale-channel attention network (PSAN) and a scale-aware anchor proposal network (SA-APN). PSAN acquires valuable scale information for feature processing, while SA-APN mainly attaches scale awareness to anchor proposing. Moreover, a new tracking benchmark for UAM approaching, namely UAMT100, is recorded with 35K frames on a flying UAM platform for evaluation. Exhaustive experiments on the benchmarks and real-world tests validate the efficiency and practicality of SiamSA with a promising speed. Both the code and UAMT100 benchmark are now available at https://github.com/vision4robotics/SiamSA.
翻译:尽管对无人驾驶航空操纵者(UAM)的操纵进行了广泛研究,但基于愿景的UAM接近,这对随后的操纵至关重要,但一般缺乏有效的设计。视觉UAM接近的关键在于物体跟踪,而目前UAM的跟踪通常依赖昂贵的模型方法。此外,UAM的接近往往面临更为严重的物体规模变异问题,这使得直接使用最先进的无型型Siamees的物体跟踪场标准不适宜。为解决上述问题,这项工作提议建立一个新型的Siamese网络,对基于愿景的UAM接近具有双向规模关注。具体地说,SiaSA是一个双向规模频道关注网络(PSAN),而目前的UAM-SAM的跟踪网络则由双向规模频道关注网络(SAPAN)和一个规模可观的固定平台(SA-APN)组成。PAN获得了用于地貌处理的有价值的规模信息,而SA-APN主要将规模意识附在定位平台上。此外,UAMMT100(UAM-100)的新的跟踪基准与35K框架一起记录了UAMUAM-UD-SA目前运行的UMAUD标准标准测试标准。