Wavelet neural networks (WNN) have been applied in many fields to solve regression as well as classification problems. After the advent of big data, as data gets generated at a brisk pace, it is imperative to analyze it as soon as it is generated owing to the fact that the nature of the data may change dramatically in short time intervals. This is necessitated by the fact that big data is all pervasive and throws computational challenges for data scientists. Therefore, in this paper, we built an efficient Scalable, Parallelized Wavelet Neural Network (SPWNN) which employs the parallel stochastic gradient algorithm (SGD) algorithm. SPWNN is designed and developed under both static and streaming environments in the horizontal parallelization framework. SPWNN is implemented by using Morlet and Gaussian functions as activation functions. This study is conducted on big datasets like gas sensor data which has more than 4 million samples and medical research data which has more than 10,000 features, which are high dimensional in nature. The experimental analysis indicates that in the static environment, SPWNN with Morlet activation function outperformed SPWNN with Gaussian on the classification datasets. However, in the case of regression, the opposite was observed. In contrast, in the streaming environment i.e., Gaussian outperformed Morlet on the classification and Morlet outperformed Gaussian on the regression datasets. Overall, the proposed SPWNN architecture achieved a speedup of 1.32-1.40.
翻译:在许多领域应用了波形神经网络(WNN)来解决回归和分类问题。在大数据出现后,随着数据以快速速度生成,一旦数据性质在短时间间隔内发生重大变化,就必须在数据生成时立即分析数据。因为大数据十分普遍,给数据科学家带来了计算挑战。因此,在本文件中,我们建立了一个高效的可缩放、平行的波形神经网络(SPWNN),利用了平行的随机梯度算法。SPWNN在横向平行化框架内的静态和流式环境中设计和开发。SPWNN是使用Morlet和高斯函数作为激活功能来执行的。这项研究是在大数据集上进行的,如400多万个样品和医学研究数据,这些数据有10 000多个特性,这些特性是高度的。实验分析显示,在静态环境中,SPWNNNN与M- 1 级梯值启动功能超过SWNNN,在横向平行的 iWNNNB 和高压式数据结构上,在已观察到的 Gaus 上进行了对比。