Time series (TS) anomaly detection (AD) plays an essential role in various applications, e.g., fraud detection in finance and healthcare monitoring. Due to the inherently unpredictable and highly varied nature of anomalies and the lack of anomaly labels in historical data, the AD problem is typically formulated as an unsupervised learning problem. The performance of existing solutions is often not satisfactory, especially in data-scarce scenarios. To tackle this problem, we propose a novel self-supervised learning technique for AD in time series, namely \emph{DeepFIB}. We model the problem as a \emph{Fill In the Blank} game by masking some elements in the TS and imputing them with the rest. Considering the two common anomaly shapes (point- or sequence-outliers) in TS data, we implement two masking strategies with many self-generated training samples. The corresponding self-imputation networks can extract more robust temporal relations than existing AD solutions and effectively facilitate identifying the two types of anomalies. For continuous outliers, we also propose an anomaly localization algorithm that dramatically reduces AD errors. Experiments on various real-world TS datasets demonstrate that DeepFIB outperforms state-of-the-art methods by a large margin, achieving up to $65.2\%$ relative improvement in F1-score.
翻译:时间序列(TS)异常检测(AD)在各种应用中发挥着必不可少的作用,例如,在金融和医疗保健监测中发现欺诈。由于异常现象固有的不可预测性质和差异性很大,历史数据中缺乏异常标签,AD问题通常被视为一个不受监督的学习问题。现有解决方案的性能往往不尽人意,特别是在数据残缺的情景中。为了解决这一问题,我们提议了一种新的自动监督学习技术,用于实时序列中的反倾销,即:\emph{DeepFIB}。我们把问题模拟成一个\emph{Fill in Blank}游戏,掩盖TS中的某些元素,并将其与其余元素进行估算。考虑到TS数据中两种常见的异常形状(点或序列外出者),我们用许多自制的培训样本执行两种遮掩战略。相应的自我估计网络可以比现有的自动解决方案更牢固的时间关系,并有效地帮助识别两种异常类型。对于连续的外差,我们还建议一种异常的本地本地化算法性算法性,大大降低了FIFIFA的相对差位数。我们用各种实际状态来显示一个巨大的方法。