预测未来的变异器。在下一个框架和时间序列预测中注意 (Transformers predicting the future. Applying attention in next-frame and time series forecasting)

Recurrent Neural Networks were, until recently, one of the best ways to capture the timely dependencies in sequences. However, with the introduction of the Transformer, it has been proven that an architecture with only attention-mechanisms without any RNN can improve on the results in various sequence processing tasks (e.g. NLP). Multiple studies since then have shown that similar approaches can be applied for images, point clouds, video, audio or time series forecasting. Furthermore, solutions such as the Perceiver or the Informer have been introduced to expand on the applicability of the Transformer. Our main objective is testing and evaluating the effectiveness of applying Transformer-like models on time series data, tackling susceptibility to anomalies, context awareness and space complexity by fine-tuning the hyperparameters, preprocessing the data, applying dimensionality reduction or convolutional encodings, etc. We are also looking at the problem of next-frame prediction and exploring ways to modify existing solutions in order to achieve higher performance and learn generalized knowledge.

翻译：经常的神经网络直到最近一直是在序列中捕捉及时依赖性的最佳方法之一。然而,随着变异器的引入,事实证明,一个只有注意机制而没有任何RNN的架构可以改进各种序列处理任务(如NLP)的结果。此后的多项研究表明,对图像、点云、视频、音频或时间序列的预测可以采用类似的方法。此外,还引入了 Perceiver或Intin等解决方案来扩大变异器的适用性。我们的主要目标是测试和评价在时间序列数据中应用类似变异器的模型的有效性,通过微调超分计、预处理数据、应用维度减法或变动编码等处理易变、环境意识和空间复杂性。我们还在研究下一个框架的预测问题,并探索修改现有解决方案的方法,以提高绩效和学习普遍知识。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

如何预测序列？看这份IJCAI2021亚马逊《大时间序列预测》教程，附301页Slides

专知会员服务

114+阅读 · 2021年8月20日

注意力机制综述

专知会员服务

207+阅读 · 2021年1月26日

最新《Transformers模型》教程，64页ppt

专知会员服务

320+阅读 · 2020年11月26日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

111+阅读 · 2020年6月10日