Convolutional neural network (CNN) achieves impressive success in the field of computer vision during the past few decades. As the core of CNNs, image convolution operation helps CNNs to achieve good performance on image-related tasks. However, image convolution is hard to be implemented and parallelized. In this paper, we propose a novel neural network model, namely CEMNet, that can be trained in frequency domain. The most important motivation of this research is that we can use the very simple element-wise multiplication operation to replace the image convolution in frequency domain based on Cross-Correlation Theorem. We further introduce Weight Fixation Mechanism to alleviate over-fitting, and analyze the working behavior of Batch Normalization, Leaky ReLU and Dropout in frequency domain to design their counterparts for CEMNet. Also, to deal with complex inputs brought by DFT, we design two branch network structure for CEMNet. Experimental results imply that CEMNet works well in frequency domain, and achieve good performance on MNIST and CIFAR-10 databases. To our knowledge, CEMNet is the first model trained in Fourier Domain that achieves more than 70\% validation accuracy on CIFAR-10 database.
翻译:在过去几十年中,在计算机视觉领域,革命神经网络(CNN)取得了令人印象深刻的成功。作为CNN的核心,图像革命行动帮助CNN在图像相关任务上取得良好业绩。然而,图像革命很难实施和平行。在本文中,我们提出一个新的神经网络模型,即CEMNet,可以在频率领域进行培训。这一研究最重要的动机是,我们可以使用非常简单的元素智能倍增操作来取代基于交叉校正理论的频率域内图像变化。我们进一步引入了重力固化机制,以缓解超称,分析Batch Algard化、Leky ReLU和频域中流出的工作行为,以设计CEMNet的对应方。此外,为了处理DFT带来的复杂投入,我们为CEMNet设计了两个分支网络结构。实验结果表明,CEMNet在频域内运作良好,并在MNIST和CIFAR-10数据库中取得良好的业绩。对于我们的知识来说,CEMNet是第一个在四号FAR数据库中经过培训的模型,其精确度超过70。