Taking the deep learning-based algorithms into account has become a crucial way to boost object detection performance in aerial images. While various neural network representations have been developed, previous works are still inefficient to investigate the noise-resilient performance, especially on aerial images with noise taken by the cameras with telephoto lenses, and most of the research is concentrated in the field of denoising. Of course, denoising usually requires an additional computational burden to obtain higher quality images, while noise-resilient is more of a description of the robustness of the network itself to different noises, which is an attribute of the algorithm itself. For this reason, the work will be started by analyzing the noise-resilient performance of the neural network, and then propose two hypotheses to build a noise-resilient structure. Based on these hypotheses, we compare the noise-resilient ability of the Oct-ResNet with frequency division processing and the commonly used ResNet. In addition, previous feature pyramid networks used for aerial object detection tasks are not specifically designed for the frequency division feature maps of the Oct-ResNet, and they usually lack attention to bridging the semantic gap between diverse feature maps from different depths. On the basis of this, a novel octave convolution-based semantic attention feature pyramid network (OcSaFPN) is proposed to get higher accuracy in object detection with noise. The proposed algorithm tested on three datasets demonstrates that the proposed OcSaFPN achieves a state-of-the-art detection performance with Gaussian noise or multiplicative noise. In addition, more experiments have proved that the OcSaFPN structure can be easily added to existing algorithms, and the noise-resilient ability can be effectively improved.
翻译:将深度学习的算法纳入考虑,这已成为提高航空图像中物体探测性能的关键方法。虽然已经开发了各种神经网络代表,但先前的工作仍然效率低下,无法对噪音抗御性性能进行调查,特别是摄像头用远光透透视镜拍摄的噪音摄像头摄取的空中图像,大部分研究集中在消化领域。当然,去除性能通常需要额外的计算负担,才能获得更高质量的图像,而噪音抗御能力则更能说明网络本身与不同噪音的强度,这是算法本身的属性。为此,将开始这项工作的办法是对神经网络的噪音抗振性性能进行有效分析,然后提出两个假体来建立具有噪音抗御性的结构。根据这些假设,我们比较Ocent-ResNet的噪音抗振荡性能力,通过频率分解处理和常用的ResNet来进行。此外,以前用于天体探测任务的地基级级变异能网络的频率分析性能网络并不是专门设计的。在Ocent-ResNet上,它们通常会将注意力压缩地基的频率分校程图上的测测算性能结构进行。