Enhancing the expressive capacity of deep learning-based time series models with self-supervised pre-training has become ever-increasingly prevalent in time series classification. Even though numerous efforts have been devoted to developing self-supervised models for time series data, we argue that the current methods are not sufficient to learn optimal time series representations due to solely unidirectional encoding over sparse point-wise input units. In this work, we propose TimeMAE, a novel self-supervised paradigm for learning transferrable time series representations based on transformer networks. The distinct characteristics of the TimeMAE lie in processing each time series into a sequence of non-overlapping sub-series via window-slicing partitioning, followed by random masking strategies over the semantic units of localized sub-series. Such a simple yet effective setting can help us achieve the goal of killing three birds with one stone, i.e., (1) learning enriched contextual representations of time series with a bidirectional encoding scheme; (2) increasing the information density of basic semantic units; (3) efficiently encoding representations of time series using transformer networks. Nevertheless, it is a non-trivial to perform reconstructing task over such a novel formulated modeling paradigm. To solve the discrepancy issue incurred by newly injected masked embeddings, we design a decoupled autoencoder architecture, which learns the representations of visible (unmasked) positions and masked ones with two different encoder modules, respectively. Furthermore, we construct two types of informative targets to accomplish the corresponding pretext tasks. One is to create a tokenizer module that assigns a codeword to each masked region, allowing the masked codeword classification (MCC) task to be completed effectively...
翻译:在时间序列分类中,加强深层学习基于时间序列模型的表达能力,并采用自我监督的训练前模块,这种模式在时间序列分类中越来越普遍。尽管已经为开发时间序列数据的自我监督模型做出了许多努力,但我们认为,目前的方法不足以学习最佳的时间序列表达方式,因为仅仅对零星点输入单元进行单向编码。在这项工作中,我们提议了时间序列(TimeMAE),这是学习基于变压器网络的可转移时间序列表达方式的新颖的自我监督模式。TimeMAE的特性在于通过窗口覆盖分隔,将每个时间序列处理成非重叠的子系列,然后对本地子系列的语系进行随机遮掩战略。这种简单而有效的设置,可以帮助我们实现用一块石头杀死三只杀死三只鸟的目标,也就是说,(1)用双向直线调调调调调调调时序来学习时间序列;(2) 增加基本语系分类单位的信息密度;(3) 利用变压的网络对时间序列进行高效的校正。然而,通过新版的代号,我们将一个新式结构任务改成一个新版本。</s>