Object detection in aerial images is an important task in environmental, economic, and infrastructure-related tasks. One of the most prominent applications is the detection of vehicles, for which deep learning approaches are increasingly used. A major challenge in such approaches is the limited amount of data that arises, for example, when more specialized and rarer vehicles such as agricultural machinery or construction vehicles are to be detected. This lack of data contrasts with the enormous data hunger of deep learning methods in general and object recognition in particular. In this article, we address this issue in the context of the detection of road vehicles in aerial images. To overcome the lack of annotated data, we propose a generative approach that generates top-down images by overlaying artificial vehicles created from 2D CAD drawings on artificial or real backgrounds. Our experiments with a modified RetinaNet object detection network show that adding these images to small real-world datasets significantly improves detection performance. In cases of very limited or even no real-world images, we observe an improvement in average precision of up to 0.70 points. We address the remaining performance gap to real-world datasets by analyzing the effect of the image composition of background and objects and give insights into the importance of background.
翻译:航空图像中的天体探测是环境、经济和基础设施相关任务中的一项重要任务。最突出的应用之一是探测车辆,这些车辆越来越多地采用深层学习方法。这些方法中的一项主要挑战在于,在诸如农业机械或建筑车辆等较专门和稀有的车辆有待探测的情况下,产生的数据数量有限。这种缺乏数据的情况与一般深层学习方法,特别是物体识别方法的巨大数据饥饿情况形成对比。在本文中,我们从探测航空图像中的公路飞行器的角度来处理这个问题。为了克服附加说明的数据的缺乏,我们建议一种基因化方法,通过在人造或真实背景的2D CAD图画上铺设的人工飞行器,产生自上而下的图像。我们用一个经过修改的Retinnet物体探测网络进行的实验表明,将这些图像添加到小型真实世界数据集中可以大大改进探测性。在一般而言,甚至根本没有现实世界图像的情况下,我们观察到平均精确度提高到0.70点的情况有所改进。我们通过分析背景和物体的图像构成和洞察力的重要性,解决现实世界数据中尚存在的性差距。