In recent years, several algorithms for system identification with neural state-space models have been introduced. Most of the proposed approaches are aimed at reducing the computational complexity of the learning problem, by splitting the optimization over short sub-sequences extracted from a longer training dataset. Different sequences are then processed simultaneously within a minibatch, taking advantage of modern parallel hardware for deep learning. An issue arising in these methods is the need to assign an initial state for each of the sub-sequences, which is required to run simulations and thus to evaluate the fitting loss. In this paper, we provide insights for calibration of neural state-space training algorithms based on extensive experimentation and analyses performed on two recognized system identification benchmarks. Particular focus is given to the choice and the role of the initial state estimation. We demonstrate that advanced initial state estimation techniques are really required to achieve high performance on certain classes of dynamical systems, while for asymptotically stable ones basic procedures such as zero or random initialization already yield competitive performance.
翻译:近年来,引入了多种神经状态-空间模型系统识别算法,大多数拟议方法旨在降低学习问题的计算复杂性,将优化与从长期培训数据集中抽取的短次序列分开,从而将优化与短次序列分开。随后,利用现代平行的深层学习硬件,在小型批中同时处理不同的序列。这些方法中产生的一个问题是,需要为每个子序列指定一个初始状态,这是进行模拟并评估适当损失所必需的。在本文件中,我们根据广泛的实验和对两个公认的系统识别基准进行的分析,为神经状态-空间培训算法的校准提供了深刻见解。特别侧重于初步状态估算的选择和作用。我们表明,为了在某些动态系统类别实现高性能,确实需要先进的初始状态估算技术,而对于诸如零或随机初始化等不稳的基本程序来说,则需要具有竞争性性的工作。