Echo State Networks (ESNs) are a special type of recurrent neural networks (RNNs), in which the input and recurrent connections are traditionally generated randomly, and only the output weights are trained. Despite the recent success of ESNs in various tasks of audio, image and radar recognition, we postulate that a purely random initialization is not the ideal way of initializing ESNs. The aim of this work is to propose an unsupervised initialization of the input connections using the K-Means algorithm on the training data. We show that this initialization performs equivalently or superior than a randomly initialized ESN whilst needing significantly less reservoir neurons (2000 vs. 4000 for spoken digit recognition, and 300 vs. 8000 neurons for f0 extraction) and thus reducing the amount of training time. Furthermore, we discuss that this approach provides the opportunity to estimate the suitable size of the reservoir based on the prior knowledge about the data.
翻译:Echo State Networks(ESNS)是一种特殊的经常性神经网络(RNNS),其输入和经常性连接传统上是随机生成的,只有输出重量才经过培训。尽管ESNS最近在音频、图像和雷达识别等各种任务中取得了成功,但我们假设,纯粹随机初始化并不是启动 ESNs 的理想方式。这项工作的目的是利用K-Means 算法在培训数据中提议一个未经监督的初始化输入连接。我们显示,这种初始化功能与随机初始启动的ESN具有同等或优越性,同时需要的储量神经元大大减少(2000年对4000个口述识别,300个神经元对8000个F0提取),从而缩短了培训时间。此外,我们讨论这一方法为根据先前对数据的了解估计储量的适当规模提供了机会。