As airborne vehicles are becoming more autonomous and ubiquitous, it has become vital to develop the capability to detect the objects in their surroundings. This paper attempts to address the problem of drones detection from other flying drones. The erratic movement of the source and target drones, small size, arbitrary shape, large intensity variations, and occlusion make this problem quite challenging. In this scenario, region-proposal based methods are not able to capture sufficient discriminative foreground-background information. Also, due to the extremely small size and complex motion of the source and target drones, feature aggregation based methods are unable to perform well. To handle this, instead of using region-proposal based methods, we propose to use a two-stage segmentation-based approach employing spatio-temporal attention cues. During the first stage, given the overlapping frame regions, detailed contextual information is captured over convolution feature maps using pyramid pooling. After that pixel and channel-wise attention is enforced on the feature maps to ensure accurate drone localization. In the second stage, first stage detections are verified and new probable drone locations are explored. To discover new drone locations, motion boundaries are used. This is followed by tracking candidate drone detections for a few frames, cuboid formation, extraction of the 3D convolution feature map, and drones detection within each cuboid. The proposed approach is evaluated on two publicly available drone detection datasets and outperforms several competitive baselines.
翻译:由于空中飞行器越来越自主和无处不在,因此,必须发展探测其周围物体的能力。本文件试图从其他飞行无人驾驶飞机中解决无人驾驶飞机探测无人驾驶飞机的问题。源与目标无人驾驶飞机移动不定、规模小、任意形状、强度变化大和隔离使这一问题变得相当具有挑战性。在这种情形下,基于区域提案的方法无法捕捉足够的具有歧视性的地表背景信息。此外,由于源与目标无人驾驶飞机规模极小且动作复杂,基于特征的汇总方法无法很好地运行。为了处理这一问题,我们提议使用基于区域提案的方法,而不是使用基于区域的提议方法来处理无人驾驶飞机探测问题。由于源与目标无人驾驶飞机的移动变化不定、规模小、任意形状、任意形状和隐蔽,因此,在第一阶段,由于框架重叠,使用图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图