The feature learning methods based on convolutional neural network (CNN) have successfully produced tremendous achievements in image classification tasks. However, the inherent noise and some other factors may weaken the effectiveness of the convolutional feature statistics. In this paper, we investigate Discrete Wavelet Transform (DWT) in the frequency domain and design a new Wavelet-Attention (WA) block to only implement attention in the high-frequency domain. Based on this, we propose a Wavelet-Attention convolutional neural network (WA-CNN) for image classification. Specifically, WA-CNN decomposes the feature maps into low-frequency and high-frequency components for storing the structures of the basic objects, as well as the detailed information and noise, respectively. Then, the WA block is leveraged to capture the detailed information in the high-frequency domain with different attention factors but reserves the basic object structures in the low-frequency domain. Experimental results on CIFAR-10 and CIFAR-100 datasets show that our proposed WA-CNN achieves significant improvements in classification accuracy compared to other related networks. Specifically, based on MobileNetV2 backbones, WA-CNN achieves 1.26% Top-1 accuracy improvement on the CIFAR-10 benchmark and 1.54% Top-1 accuracy improvement on the CIFAR-100 benchmark.
翻译:以进化神经网络为基础的特征学习方法成功地在图像分类任务方面取得了巨大的成就,然而,内在的噪音和其他一些因素可能会削弱进化特征统计的有效性。在本文件中,我们调查频率域中的分解波列变换(DWT),并设计一个新的波列-惯性(WA)块,仅能在高频域引起注意。在此基础上,我们提议建立一个波列-惯性动态神经网络(WA-CNN),用于图像分类。具体地说,WA-CNN将特征图分解成低频和高频组成部分,用于储存基本物体的结构以及详细的信息和噪音。然后,WA区被利用,以不同关注因素捕捉高频域的详细信息,但保留低频域的基本目标结构。CFAR-10和CIFAR-100数据集的实验结果显示,我们提议的WA-CN比其他相关网络的分类准确度显著提高。具体地说,基于SOPRO-FAR-1基准1和IMFAR-1的改进率。