Convolutional neural network (CNN) has achieved impressive success in computer vision during the past few decades. As the core of CNNs, the image convolution operation helps CNNs to get good performance on image-related tasks. However, the image convolution is hard to be implemented and parallelized. This paper proposes a novel neural network model, namely CEMNet, which can be trained in the frequency domain. The most important motivation of this research is that we can use the straightforward element-wise multiplication operation to replace the image convolution in the frequency domain based on the Cross-Correlation Theorem. We further introduce a Weight Fixation mechanism to alleviate the problem of over-fitting, and analyze the working behavior of Batch Normalization, Leaky ReLU, and Dropout in the frequency domain to design their counterparts for CEMNet. Also, to deal with complex inputs brought by Discrete Fourier Transform, we design a two-branches network structure for CEMNet. Experimental results imply that CEMNet achieves good performance on MNIST and CIFAR-10 databases. To the best of our knowledge, CEMNet is the first model trained in Fourier Domain that achieves more than 70\% validation accuracy on CIFAR-10 database.
翻译:在过去几十年中,计算机神经神经网络(CNN)在计算机视觉方面取得了令人印象深刻的成功。作为CNN的核心,图像变异操作帮助CNN在图像相关任务上取得良好表现。然而,图像变异很难实施,而且很难平行。本文件提议了一个新型神经网络模型,即CEMNet,可以在频率领域接受培训。这一研究最重要的动力是,我们可以使用直截了当的元素变异操作来取代基于交叉校正理论的频率领域的图像变异。作为CNN的核心,我们进一步引入了一个重力变异机制,以缓解超装问题,并分析Batch正常化、Laky ReLU和频域中辍学的工作行为,以设计CEMNet的对应方。此外,为了处理Discrete Fourier变换型带来的复杂投入,我们为CEMNet设计了两层网络结构。实验结果表明,CEMNet在MNIST和CIFAR-10数据库上取得了良好的业绩。为了最佳的准确性,CEM网络在我们的70-10号数据库中实现了最先进的验证。