Recurrent neural networks (RNNs) have brought a lot of advancements in sequence labeling tasks and sequence data. However, their effectiveness is limited when the observations in the sequence are irregularly sampled, where the observations arrive at irregular time intervals. To address this, continuous time variants of the RNNs were introduced based on neural ordinary differential equations (NODE). They learn a better representation of the data using the continuous transformation of hidden states over time, taking into account the time interval between the observations. However, they are still limited in their capability as they use the discrete transformations and a fixed discrete number of layers (depth) over an input in the sequence to produce the output observation. We intend to address this limitation by proposing RNNs based on differential equations which model continuous transformations over both depth and time to predict an output for a given input in the sequence. Specifically, we propose continuous depth recurrent neural differential equations (CDR-NDE) which generalizes RNN models by continuously evolving the hidden states in both the temporal and depth dimensions. CDR-NDE considers two separate differential equations over each of these dimensions and models the evolution in the temporal and depth directions alternatively. We also propose the CDR-NDE-heat model based on partial differential equations which treats the computation of hidden states as solving a heat equation over time. We demonstrate the effectiveness of the proposed models by comparing against the state-of-the-art RNN models on real world sequence labeling problems and data.
翻译:经常神经网络(RNNS)在标记任务和序列数据的顺序排列方面带来了许多进步;然而,当序列中的观测是非定期抽样时,其效力是有限的,因为观测是在不定期的时间间隔下进行的,为了解决这个问题,根据神经普通差异方程式(NODE)引入了连续的时间变异。它们学会了利用隐藏国家随时间变化的连续变换来更好地表述数据,同时考虑到观察之间的时间间隔。但是,它们的能力仍然有限,因为它们使用离散变换和固定的离散层数(深度)在生成输出观察的序列中输入一个输入。我们打算解决这一限制,办法是根据不同方程式提出RNNNS的连续变换,这些变换以深度和时间的模型为基础,以预测序列中某项输入的输出输出输出。具体地说,我们建议持续深度的反复变异变方方方程式(CDNNE),通过在时间和深度方面的变化中不断改变隐藏状态。CDR-NDE考虑两个不同的变异方程式,这些变异方程式针对每个实际的变异方程式的模型,我们以时程和部分变式的变式在时间和变式计算中,也用C-C-C-C-rodeal-C-C-C-C-C-ro化的变式的变式的变式的变式的变式的变式的变式模型,在时间和制的变式计算。