Despite the recent advances of deep neural networks, object detection for adverse weather remains challenging due to the poor perception of some sensors in adverse weather. Instead of relying on one single sensor, multimodal fusion has been one promising approach to provide redundant detection information based on multiple sensors. However, most existing multimodal fusion approaches are ineffective in adjusting the focus of different sensors under varying detection environments in dynamic adverse weather conditions. Moreover, it is critical to simultaneously observe local and global information under complex weather conditions, which has been neglected in most early or late-stage multimodal fusion works. In view of these, this paper proposes a Global-Local Attention (GLA) framework to adaptively fuse the multi-modality sensing streams, i.e., camera, gated camera, and lidar data, at two fusion stages. Specifically, GLA integrates an early-stage fusion via a local attention network and a late-stage fusion via a global attention network to deal with both local and global information, which automatically allocates higher weights to the modality with better detection features at the late-stage fusion to cope with the specific weather condition adaptively. Experimental results demonstrate the superior performance of the proposed GLA compared with state-of-the-art fusion approaches under various adverse weather conditions, such as light fog, dense fog, and snow.
翻译:尽管最近出现了深层神经网络的进展,但是由于对恶劣天气中某些传感器的认知不良,对不利天气的物体探测仍然具有挑战性。多式联运融合不是依赖单一传感器,而是一种有希望的办法,在多个传感器的基础上提供多余的探测信息。然而,大多数现有的多式联运融合方法在动态恶劣天气条件下,在不同的探测环境中,在不同的探测环境中调整不同传感器的焦点方面是无效的。此外,在复杂的天气条件下同时观测当地和全球信息至关重要,在大多数早期或后期的多式联运融合工作中,这种气候条件一直受到忽视,因此,在复杂的天气条件下,对当地和全球信息进行同步观测,这在大多数早期或后期的多式联运融合工作中都受到忽视。有鉴于此,本文件建议建立一个全球-地方注意框架,以便在两个融合阶段,即摄影机、闭门摄影机和激光雷达数据,在适应特定天气状况的适应性融合阶段,即摄影机、闭门摄影机和激光雷达数据,在两个融合阶段,通过当地关注网络,通过全球关注网络,对当地和全球信息进行早期和末阶段的融合,自动将较高重量分配给在较佳的探测特征,以适应特定气候条件的状态下,实验性结果显示,在高度温度下,在高度温度下,在高度温度下,在高度下,在高度温度下进行。