Object detection is one of the key target tasks of interest in the context of civil and military applications. In particular, the real-world deployment of target detection methods is pivotal in the decision-making process during military command and reconnaissance. However, current domain adaptive object detection algorithms consider adapting one domain to another similar one only within the scope of natural or autonomous driving scenes. Since military domains often deal with a mixed variety of environments, detecting objects from multiple varying target domains poses a greater challenge. Several studies for armored military target detection have made use of synthetic aperture radar (SAR) data due to its robustness to all weather, long range, and high-resolution characteristics. Nevertheless, the costs of SAR data acquisition and processing are still much higher than those of the conventional RGB camera, which is a more affordable alternative with significantly lower data processing time. Furthermore, the lack of military target detection datasets limits the use of such a low-cost approach. To mitigate these issues, we propose to generate RGB-based synthetic data using a photorealistic visual tool, Unreal Engine, for military target detection in a cross-domain setting. To this end, we conducted synthetic-to-real transfer experiments by training our synthetic dataset and validating on our web-collected real military target datasets. We benchmark the state-of-the-art domain adaptation methods distinguished by the degree of supervision on our proposed train-val dataset pair, and find that current methods using minimal hints on the image (e.g., object class) achieve a substantial improvement over unsupervised or semi-supervised DA methods. From these observations, we recognize the current challenges that remain to be overcome.
翻译:目标检测是民用和军事应用背景下关注的关键目标任务之一。特别是在军事指挥与侦察的决策过程中,目标检测方法的实际部署至关重要。然而,当前的领域自适应目标检测算法仅考虑在自然场景或自动驾驶场景范围内将一个领域适应到另一个相似领域。由于军事领域通常涉及多种混合环境,从多个变化的目标域中检测物体提出了更大的挑战。一些针对装甲军事目标检测的研究利用了合成孔径雷达(SAR)数据,因其具有全天候、远距离和高分辨率的鲁棒特性。然而,SAR数据的获取和处理成本仍远高于传统的RGB相机,后者是一种更经济实惠的替代方案,且数据处理时间显著更短。此外,军事目标检测数据集的缺乏限制了这种低成本方法的应用。为缓解这些问题,我们提出使用逼真视觉工具Unreal Engine生成基于RGB的合成数据,用于跨域环境下的军事目标检测。为此,我们通过训练合成数据集并在网络收集的真实军事目标数据集上进行验证,开展了合成到真实的迁移实验。我们在提出的训练-验证数据集对上,以监督程度为区分标准,对最先进的领域自适应方法进行了基准测试,发现当前使用图像上最小提示(例如,物体类别)的方法相较于无监督或半监督DA方法取得了显著改进。基于这些观察,我们认识到当前仍待克服的挑战。