Background/introduction: Cross-Validation (CV) is still uncommon in time series modeling. Echo State Networks (ESNs), as a prime example of Reservoir Computing (RC) models, are known for their fast and precise one-shot learning, that often benefit from good hyper-parameter tuning. This makes them ideal to change the status quo. Methods: We discuss CV of time series for predicting a concrete time interval of interest, suggest several schemes for cross-validating ESNs and introduce an efficient algorithm for implementing them. This algorithm is presented as two levels of optimizations of doing $k$-fold CV. Training an RC model typically consists of two stages: (i) running the reservoir with the data and (ii) computing the optimal readouts. The first level of our optimization addresses the most computationally expensive part (i) and makes it remain constant irrespective of $k$. It dramatically reduces reservoir computations in any type of RC system and is enough if $k$ is small. The second level of optimization also makes the (ii) part remain constant irrespective of large $k$, as long as the dimension of the output is low. We discuss when the proposed validation schemes for ESNs could be beneficial, three options for producing the final model and empirically investigate them on six different real-world datasets, as well as do empirical computation time experiments. We provide the code in an online repository. Results: Proposed CV schemes give better and more stable test performance in all the six different real-world datasets, three task types. Empirical run times confirm our complexity analysis. Conclusions: In most situations $k$-fold CV of ESNs and many other RC models can be done for virtually the same time and space complexity as a simple single-split validation. This enables CV to become a standard practice in RC.
翻译:背景/感化:交叉估值(CV)在时间序列模型中仍然少见。回声状态网络(ESNs)作为RESOV(RC)模型的一个典型范例,以快速和精确的一拍学习而著称,往往受益于良好的超参数调。这使得它们理想地改变现状。方法:我们讨论用于预测具体时间间隔的时间序列(CV),以预测具体时间间隔;建议多项交叉验证 ESNS 计划,并引入一个高效的计算方法来实施它们。这一算法以两个水平优化美元乘以 CV。培训RC模型通常由两个阶段组成:(一) 以数据运行储油库和(二) 计算最佳读数。这使他们最理想地适应于计算最昂贵的部分(一), 使其保持不变。这极大地降低了任何类型RCSN系统对储油量的计算方法, 并且足够低值的计算方法。第二层优化也使得(二) 部分保持不变,而不论以美元计算的是多少美元或多少个在线情况。在模型中,SWSUR 的六种快速的运算算算算算算得更精确的。