Echo state networks (ESNs) are a powerful form of reservoir computing that only require training of linear output weights whilst the internal reservoir is formed of fixed randomly connected neurons. With a correctly scaled connectivity matrix, the neurons' activity exhibits the echo-state property and responds to the input dynamics with certain timescales. Tuning the timescales of the network can be necessary for treating certain tasks, and some environments require multiple timescales for an efficient representation. Here we explore the timescales in hierarchical ESNs, where the reservoir is partitioned into two smaller linked reservoirs with distinct properties. Over three different tasks (NARMA10, a reconstruction task in a volatile environment, and psMNIST), we show that by selecting the hyper-parameters of each partition such that they focus on different timescales, we achieve a significant performance improvement over a single ESN. Through a linear analysis, and under the assumption that the timescales of the first partition are much shorter than the second's (typically corresponding to optimal operating conditions), we interpret the feedforward coupling of the partitions in terms of an effective representation of the input signal, provided by the first partition to the second, whereby the instantaneous input signal is expanded into a weighted combination of its time derivatives. Furthermore, we propose a data-driven approach to optimise the hyper-parameters through a gradient descent optimisation method that is an online approximation of backpropagation through time. We demonstrate the application of the online learning rule across all the tasks considered.
翻译:快速回声状态网络(ESNs)是一种强大的储油层计算形式,只需要培训线性输出重量,而内部储油层则由固定随机连接的神经元组成。通过一个正确缩放的连接矩阵,神经元的活动显示回声状态属性,并用某些时间尺度对输入动态作出反应。为处理某些任务,有必要对网络的时间尺度进行调整,而有些环境需要多个时间尺度才能有效代表。在这里,我们探索ESN等级的时间尺度,即储油层被分割成两个有不同特性的小型连接储油层。在三种不同任务(NARMA10, 不稳定环境中的重建任务,以及 PSSMNIST)中,我们通过选择每个分区的超参数属性属性,以不同的时间尺度对输入动态动态进行响应。我们通过线性分析,假设第一个分区的时间尺度比第二个更短得多(通常与最佳操作条件相对应 ),我们解读了第二个间隔的进料组合。我们理解了第二个间隔的进料组合,从一个有效的线性指数模型到一个方向,我们提供了一个方向的螺旋路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路由。我们通过一个通过一个由路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路路