Attention Branch Networks (ABNs) have been shown to simultaneously provide visual explanation and improve the performance of deep convolutional neural networks (CNNs). In this work, we introduce Multi-Scale Attention Branch Networks (MSABN), which enhance the resolution of the generated attention maps, and improve the performance. We evaluate MSABN on benchmark image recognition and fine-grained recognition datasets where we observe MSABN outperforms ABN and baseline models. We also introduce a new data augmentation strategy utilizing the attention maps to incorporate human knowledge in the form of bounding box annotations of the objects of interest. We show that even with a limited number of edited samples, a significant performance gain can be achieved with this strategy.
翻译:在这项工作中,我们引入了多层次关注分支网络(MSABN),这些网络加强了所生成的注意地图的分辨率,并改进了性能;我们根据基准图像识别和微缩的识别数据集对MSABN进行了评估,我们在那里看到MISBN优于ABN和基线模型;我们还采用了新的数据增强战略,利用关注地图将人类知识纳入其中,对感兴趣的对象进行捆绑式方框说明;我们表明,即使有数量有限的经过编辑的样本,这一战略也能取得显著的绩效收益。