Recently, the availability of remote sensing imagery from aerial vehicles and satellites constantly improved. For an automated interpretation of such data, deep-learning-based object detectors achieve state-of-the-art performance. However, established object detectors require complete, precise, and correct bounding box annotations for training. In order to create the necessary training annotations for object detectors, imagery can be georeferenced and combined with data from other sources, such as points of interest localized by GPS sensors. Unfortunately, this combination often leads to poor object localization and missing annotations. Therefore, training object detectors with such data often results in insufficient detection performance. In this paper, we present a novel approach for training object detectors with extremely noisy and incomplete annotations. Our method is based on a teacher-student learning framework and a correction module accounting for imprecise and missing annotations. Thus, our method is easy to use and can be combined with arbitrary object detectors. We demonstrate that our approach improves standard detectors by 37.1\% $AP_{50}$ on a noisy real-world remote-sensing dataset. Furthermore, our method achieves great performance gains on two datasets with synthetic noise. Code is available at \url{https://github.com/mxbh/robust_object_detection}.
翻译:最近,从航空飞行器和卫星获得遥感图像的情况不断改善;为了对这些数据进行自动化解释,基于深学习的物体探测器取得最新性能;然而,既定的物体探测器需要完整、准确和正确的训练框说明;为了为物体探测器创造必要的培训说明,图像可以进行地理参照,并与其他来源的数据(例如全球定位系统传感器点点点)相结合;不幸的是,这种结合往往导致物体定位差和说明缺失。因此,使用这些数据的训练物体探测器往往导致探测性能不足。在本文件中,我们提出了一种新颖的方法,用极为吵闹和不完整的说明来训练物体探测器。我们的方法是以教师-学生学习框架和校正模块为基础,说明不准确和缺失的说明。因此,我们的方法很容易使用,并且可以与任意的物体探测器相结合。我们证明,我们的方法是用一种噪音噪音在现实遥感数据集上改进标准探测器37.1 $AP ⁇ 50美元。此外,我们的方法在合成噪音两个数据集上取得了很大的性益。