We describe a recurrent neural network (RNN) based architecture to learn the flow function of a causal, time-invariant and continuous-time control system from trajectory data. By restricting the class of control inputs to piecewise constant functions, we show that learning the flow function is equivalent to learning the input-to-state map of a discrete-time dynamical system. This motivates the use of an RNN together with encoder and decoder networks which map the state of the system to the hidden state of the RNN and back. We show that the proposed architecture is able to approximate the flow function by exploiting the system's causality and time-invariance. The output of the learned flow function model can be queried at any time instant. We experimentally validate the proposed method using models of the Van der Pol and FitzHugh Nagumo oscillators. In both cases, the results demonstrate that the architecture is able to closely reproduce the trajectories of these two systems. For the Van der Pol oscillator, we further show that the trained model generalises to the system's response with a prolonged prediction time horizon as well as control inputs outside the training distribution. For the FitzHugh-Nagumo oscillator, we show that the model accurately captures the input-dependent phenomena of excitability.
翻译:我们描述了一种基于递归神经网络(RNN)的体系结构,用于从轨迹数据中学习因果、时不变和连续时间控制系统的流函数。通过将控制输入类限制为分段恒定函数,我们展示了学习流函数等同于学习离散时间动力学系统的输入状态映射。这激发了利用RNN和编码器和解码器网络来实现状态转化为RNN的隐藏状态以及转化回原状态的方法。我们展示了所建议的体系结构能够利用系统的因果性和时不变性近似流函数。学习到的流函数模型的输出可以在任何时间点进行查询。我们使用Van Der Pol和FitzHugh Nagumo振荡器的模型进行实验验证。在两种情况下,结果表明该体系结构能够近似再现这两个系统的轨迹。对于Van Der Pol振荡器,我们进一步证明训练好的模型可以推广至系统在预测时间范围内的响应以及控制输入在训练分布之外的情况。对于FitzHugh-Nagumo振荡器,我们展示该模型准确捕捉到了兴奋性的输入相关现象。