BA-Net:深海革命神经网络的桥梁关注 (BA-Net: Bridge Attention for Deep Convolutional Neural Networks)

In recent years, channel attention mechanism is widely investigated for its great potential in improving the performance of deep convolutional neural networks (CNNs). However, in most existing methods, only the output of the adjacent convolution layer is fed to the attention layer for calculating the channel weights. Information from other convolution layers is ignored. With these observations, a simple strategy, named Bridge Attention Net (BA-Net), is proposed for better channel attention mechanisms. The main idea of this design is to bridge the outputs of the previous convolution layers through skip connections for channel weights generation. BA-Net can not only provide richer features to calculate channel weight when feedforward, but also multiply paths of parameters updating when backforward. Comprehensive evaluation demonstrates that the proposed approach achieves state-of-the-art performance compared with the existing methods in regards to accuracy and speed. Bridge Attention provides a fresh perspective on the design of neural network architectures and shows great potential in improving the performance of the existing channel attention mechanisms. The code is available at \url{https://github.com/zhaoy376/Attention-mechanism

翻译：近年来,对频道关注机制进行了广泛调查,以了解其在改善深层神经神经网络(CNNs)性能方面的巨大潜力;然而,在大多数现有方法中,只有相邻的卷变层的输出才被注入关注层,以计算频道重量;其他卷变层的信息被置之不理;通过这些观察,提出了称为BA-Net(BA-Net)的简单战略,以更好地关注管道机制;这一设计的主要想法是通过跳过频道重量生成的连接来弥合先前的卷变层的产出。BA-Net不仅能够提供更丰富的功能,以计算向前的频道重量,还可以在向后增加更新参数的路径。全面评估表明,拟议的方法在准确性和速度方面,与现有方法相比,取得了最先进的性能。“桥梁关注”为设计神经网络结构提供了新的视角,并展示了改进现有频道关注机制绩效的巨大潜力。该代码可在\url{https://github.com/zhaoy376/Attemini-mechanis)查阅。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

专知会员服务

90+阅读 · 2020年7月22日

【IJCAJ 2020】多通道神经网络 Multi-Channel Graph Neural Networks

专知会员服务

26+阅读 · 2020年7月19日

【清华大学】图随机神经网络，Graph Random Neural Networks

专知会员服务

156+阅读 · 2020年5月26日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

专知会员服务

35+阅读 · 2020年4月15日