共同关注的多任务时间序列预报 (Multi-Task Time Series Forecasting With Shared Attention)

Time series forecasting is a key component in many industrial and business decision processes and recurrent neural network (RNN) based models have achieved impressive progress on various time series forecasting tasks. However, most of the existing methods focus on single-task forecasting problems by learning separately based on limited supervised objectives, which often suffer from insufficient training instances. As the Transformer architecture and other attention-based models have demonstrated its great capability of capturing long term dependency, we propose two self-attention based sharing schemes for multi-task time series forecasting which can train jointly across multiple tasks. We augment a sequence of paralleled Transformer encoders with an external public multi-head attention function, which is updated by all data of all tasks. Experiments on a number of real-world multi-task time series forecasting tasks show that our proposed architectures can not only outperform the state-of-the-art single-task forecasting baselines but also outperform the RNN-based multi-task forecasting method.

翻译：时间序列预测是许多工业和商业决策过程的一个关键组成部分,基于经常神经网络的模型在各种时间序列预测任务方面取得了令人印象深刻的进展,然而,大多数现有方法侧重于单任务预测问题,根据有限的监督目标分别学习,而这些目标往往缺乏足够的培训。由于变换器结构和其他关注模型显示了它捕捉长期依赖性的巨大能力,我们提议了两种基于自我注意的多任务时间序列预测共享计划,可以对多个任务进行联合培训。我们增加了一系列平行的变换器编码器与外部公共多头关注功能的序列,由所有任务的数据加以更新。关于一些现实世界多任务时间序列预测任务的实验表明,我们拟议的结构不仅能够超越最先进的单一任务预测基线,而且能够超越基于RNN的多任务预测方法。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

【斯坦福大学课程】2021年深度多任务学习与元学习，CS 330: Deep Multi-Task and Meta Learning

专知会员服务

110+阅读 · 2022年3月2日

最新《弱监督预训练语言模型微调》报告，52页ppt

专知会员服务

38+阅读 · 2020年12月26日

【2020干货书】Python监督学习，387页pdf，使用Python的概念和实践

专知会员服务

72+阅读 · 2020年10月11日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日