Small object detection presents a significant challenge in computer vision and object detection. The performance of small object detectors is often compromised by a lack of pixels and less significant features. This issue stems from information misalignment caused by variations in feature scale and information loss during feature processing. In response to this challenge, this paper proposes a novel the Multi to Single Module (M2S), which enhances a specific layer through improving feature extraction and refining features. Specifically, M2S includes the proposed Cross-scale Aggregation Module (CAM) and explored Dual Relationship Module (DRM) to improve information extraction capabilities and feature refinement effects. Moreover, this paper enhances the accuracy of small object detection by utilizing M2S to generate an additional detection head. The effectiveness of the proposed method is evaluated on two datasets, VisDrone2021-DET and SeaDronesSeeV2. The experimental results demonstrate its improved performance compared with existing methods. Compared to the baseline model (YOLOv5s), M2S improves the accuracy by about 1.1\% on the VisDrone2021-DET testing dataset and 15.68\% on the SeaDronesSeeV2 validation set.
翻译:小物体检测在计算机视觉和物体检测中表示出极大的挑战性。由于像素不足和较少的显著特征,小物体探测器的性能常常受到影响。这个问题源于特征尺度的变化和特征处理过程中的信息丢失导致的信息不对齐。为了应对这一挑战,本文提出了一种新型的多至单模块(M2S),通过提高特征提取和特征优化来增强特定层的性能。具体来说,M2S包括所提出的跨尺度聚合模块(CAM)和探索的双重关系模块(DRM),以提高信息提取能力和特征优化效果。此外,本文通过利用M2S来生成额外的检测头来提高小物体检测的准确性。我们在两个数据集VisDrone2021-DET和SeaDronesSeeV2上评估了所提出方法的有效性。实验结果表明,与现有方法相比,它具有更好的性能。与基线模型(YOLOv5s)相比,M2S在VisDrone2021-DET测试数据集上将准确度提高了约1.1%,在SeaDronesSeeV2验证集上提高了15.68%。