To address the challenges in UAV object detection, such as complex backgrounds, severe occlusion, dense small objects, and varying lighting conditions,this paper proposes PT-DETR based on RT-DETR, a novel detection algorithm specifically designed for small objects in UAV imagery. In the backbone network, we introduce the Partially-Aware Detail Focus (PADF) Module to enhance feature extraction for small objects. Additionally,we design the Median-Frequency Feature Fusion (MFFF) module,which effectively improves the model's ability to capture small-object details and contextual information. Furthermore,we incorporate Focaler-SIoU to strengthen the model's bounding box matching capability and increase its sensitivity to small-object features, thereby further enhancing detection accuracy and robustness. Compared with RT-DETR, our PT-DETR achieves mAP improvements of 1.6% and 1.7% on the VisDrone2019 dataset with lower computational complexity and fewer parameters, demonstrating its robustness and feasibility for small-object detection tasks.
翻译:针对无人机目标检测中背景复杂、遮挡严重、小目标密集以及光照条件多变等挑战,本文基于RT-DETR提出PT-DETR,一种专为无人机影像中小目标设计的新型检测算法。在骨干网络中,我们引入了局部感知细节聚焦模块以增强对小目标的特征提取能力。此外,我们设计了中频特征融合模块,有效提升了模型捕捉小目标细节与上下文信息的能力。进一步地,我们采用Focaler-SIoU损失函数来强化模型的边界框匹配能力,并增强其对小目标特征的敏感性,从而进一步提升检测精度与鲁棒性。与RT-DETR相比,我们的PT-DETR在VisDrone2019数据集上实现了mAP分别提升1.6%和1.7%,同时具有更低的计算复杂度和更少的参数量,证明了其在小目标检测任务中的鲁棒性与可行性。