时间序列分类和预测的时间序列分类和预测 (Attentive Neural Controlled Differential Equations for Time-series Classification and Forecasting)

Neural networks inspired by differential equations have proliferated for the past several years. Neural ordinary differential equations (NODEs) and neural controlled differential equations (NCDEs) are two representative examples of them. In theory, NCDEs provide better representation learning capability for time-series data than NODEs. In particular, it is known that NCDEs are suitable for processing irregular time-series data. Whereas NODEs have been successfully extended after adopting attention, however, it had not been studied yet how to integrate attention into NCDEs. To this end, we present the method of Attentive Neural Controlled Differential Equations (ANCDEs) for time-series classification and forecasting, where dual NCDEs are used: one for generating attention values, and the other for evolving hidden vectors for a downstream machine learning task. We conduct experiments with three real-world time-series datasets and 10 baselines. After dropping some values, we also conduct irregular time-series experiments. Our method consistently shows the best accuracy in all cases by non-trivial margins. Our visualizations also show that the presented attention mechanism works as intended by focusing on crucial information.

翻译：在过去几年中,受差异方程式启发的神经网络已经扩散了。神经普通差异方程式和神经控制差异方程式是其中两个具有代表性的例子。理论上,NCDEs为时间序列数据提供了比NODEs更好的代表性学习能力。特别是,众所周知,NCDEs适合处理非正常时间序列数据。虽然NDEs在被关注之后成功地扩展了,但是,没有研究如何将注意力纳入NCDEs。为此,我们提出了用于时间序列分类和预测的Attenal Control drominal Equations(ANDEs)方法,其中使用了双重NCDEs:一个用于引起注意值,另一个用于为下游机学习任务开发隐藏矢量。我们用三个真实世界的时间序列数据集和10个基线进行实验。在降低某些值之后,我们还进行了不规则的时间序列实验。我们的方法始终以非三边边边为对象,显示所有案例的最佳准确性。我们所展示的注意机制也显示,以关键信息为重点。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

【图神经网络导论】Intro to Graph Neural Networks，176页ppt

专知会员服务

129+阅读 · 2021年6月4日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日