空中图像物体探测学习校准指南 (Learning Calibrated-Guidance for Object Detection in Aerial Images)

Object detection is one of the most fundamental yet challenging research topics in the domain of computer vision. Recently, the study on this topic in aerial images has made tremendous progress. However, complex background and worse imaging quality are obvious problems in aerial object detection. Most state-of-the-art approaches tend to develop elaborate attention mechanisms for the space-time feature calibrations with arduous computational complexity, while surprisingly ignoring the importance of feature calibrations in channel-wise. In this work, we propose a simple yet effective Calibrated-Guidance (CG) scheme to enhance channel communications in a feature transformer fashion, which can adaptively determine the calibration weights for each channel based on the global feature affinity correlations. Specifically, for a given set of feature maps, CG first computes the feature similarity between each channel and the remaining channels as the intermediary calibration guidance. Then, re-representing each channel by aggregating all the channels weighted together via the guidance operation. Our CG is a general module that can be plugged into any deep neural networks, which is named as CG-Net. To demonstrate its effectiveness and efficiency, extensive experiments are carried out on both oriented object detection task and horizontal object detection task in aerial images. Experimental results on two challenging benchmarks (DOTA and HRSC2016) demonstrate that our CG-Net can achieve the new state-of-the-art performance in accuracy with a fair computational overhead. The source code has been open sourced at https://github.com/WeiZongqi/CG-Net

翻译：计算机视觉领域最根本但最具挑战性的研究课题之一。最近,航空图像中关于这一主题的研究取得了巨大进展。然而,复杂的背景和更差的成像质量是空中物体探测的明显问题。大多数最先进的方法倾向于为具有艰苦计算复杂性的时空特征校准制定细致的注意机制,同时令人惊讶地忽视了频道内地貌校准的重要性。在这项工作中,我们提出了一个简单而有效的校准-指导(CG)计划,用地貌变异器方式加强频道通信,这种变异器可以适应地决定每个频道的校准权重。具体地说,对于一套特定的地貌图,CG首先将每个频道和其余的频道作为中间校准指南,形成一个相似的注意机制。然后,通过将所有通过导航操作加权的频道统合起来,我们CGG是一个一般的模块,可以插入任何深线源网络,称为CG-Net。为了展示其效力和效率,在一系列地特征地图上进行广泛的实验,每个频道之间的特性相似性能为C-C- C-C 和LA-C-C-C-C-alalalalalal adal laimal laimal lab-dal laudal exal lab-dal labalal labal labal labal labal labal labal labal labal labal ladal labal ladal labal labal labal labal labal lautal labal labal labal labal ladal labal labal lautal labal labal labal labal labal labal labal labal labal labal labal labal labal labal labal labal ladalal labal ladal ladal ladal ladal ladal ladal labal labal labal-al-al-al-al-al-al