Detecting objects in aerial images is challenging because they are typically composed of crowded small objects distributed non-uniformly over high-resolution images. Density cropping is a widely used method to improve this small object detection where the crowded small object regions are extracted and processed in high resolution. However, this is typically accomplished by adding other learnable components, thus complicating the training and inference over a standard detection process. In this paper, we propose an efficient Cascaded Zoom-in (CZ) detector that re-purposes the detector itself for density-guided training and inference. During training, density crops are located, labeled as a new class, and employed to augment the training dataset. During inference, the density crops are first detected along with the base class objects, and then input for a second stage of inference. This approach is easily integrated into any detector, and creates no significant change in the standard detection process, like the uniform cropping approach popular in aerial image detection. Experimental results on the aerial images of the challenging VisDrone and DOTA datasets verify the benefits of the proposed approach. The proposed CZ detector also provides state-of-the-art results over uniform cropping and other density cropping methods on the VisDrone dataset, increasing the detection mAP of small objects by more than 3 points.
翻译:在空中图像中检测物体具有挑战性,因为它们通常由拥挤的小物体组成,分布不统一,不以高分辨率图像为主; 密度裁剪是一种广泛使用的方法,用来改进这种小物体探测,因为拥挤的小物体区域是高分辨率提取和处理的; 然而,这通常是通过增加其他可学习的组成部分来实现的,从而使得标准探测过程的培训和推断复杂化。 在本文件中,我们建议建立一个高效的连锁缩放(CZ)探测器,该探测器可以重新利用探测器本身进行密度导导导培训和推断。 在培训期间,密度作物被定位,标记为一个新的类别,并用来扩大培训数据集。 在推断期间,密度作物首先与基级物体一起被检测,然后输入第二阶段的推断。 这种方法很容易融入任何探测器,并且不会对标准探测过程产生重大变化,像在空中图像探测中流行的统一裁剪法一样,对具有挑战性的Visdrone和DOTA数据装置本身进行实验的结果, 用于扩大培训数据集。 在推断期间,密度作物密度测算方法首先与基级物体一起检测,然后输入其他作物测算方法。</s>