The exploding research interest for neural networks in modeling nonlinear dynamical systems is largely explained by the networks' capacity to model complex input-output relations directly from data. However, they typically need vast training data before they can be put to any good use. The data generation process for dynamical systems can be an expensive endeavor both in terms of time and resources. Active learning addresses this shortcoming by acquiring the most informative data, thereby reducing the need to collect enormous datasets. What makes the current work unique is integrating the deep active learning framework into nonlinear system identification. We formulate a general static deep active learning acquisition problem for nonlinear system identification. This is enabled by exploring system dynamics locally in different regions of the input space to obtain a simulated dataset covering the broader input space. This simulated dataset can be used in a static deep active learning acquisition scheme referred to as global explorations. The global exploration acquires a batch of initial states corresponding to the most informative state-action trajectories according to a batch acquisition function. The local exploration solves an optimal control problem, finding the control trajectory that maximizes some measure of information. After a batch of informative initial states is acquired, a new round of local explorations from the initial states in the batch is conducted to obtain a set of corresponding control trajectories that are to be applied on the system dynamics to get data from the system. Information measures used in the acquisition scheme are derived from the predictive variance of an ensemble of neural networks. The novel method outperforms standard data acquisition methods used for system identification of nonlinear dynamical systems in the case study performed on simulated data.
翻译:神经网络在建模非线性动态系统方面的研究兴趣激增,主要是因为这些网络有能力直接从数据中建模复杂的输入-输出关系。 但是,它们通常需要大量的培训数据才能被很好地使用。 动态系统的数据生成过程在时间和资源两方面都是一种昂贵的努力。 积极学习通过获取信息量最大的数据来弥补这一缺陷,从而减少了收集巨大数据集的需要。 使当前工作独特之处在于将深度主动学习框架纳入非线性动态系统识别。 我们为非线性系统识别开发一个一般静态的深层学习获取问题。 之所以能够做到这一点,是因为在输入空间的不同区域探索一个模拟的系统动态数据,以获得覆盖更广泛的输入空间的数据集。 这个模拟数据集可用于一个静态的深度学习获取计划,称为全球探索。 全球探索获得了一系列与最信息量的州-行动轨迹轨迹相对应的初始状态, 本地勘探解决了一个最佳的控制问题, 找到从本地系统对非线性内线性数据排序的某种控制轨迹。 在使用初始数据采集方法后, 将进行初步数据采集到本地数据采集系统。</s>