Anomaly detection in time series has been widely researched and has important practical applications. In recent years, anomaly detection algorithms are mostly based on deep-learning generative models and use the reconstruction error to detect anomalies. They try to capture the distribution of normal data by reconstructing normal data in the training phase, then calculate the reconstruction error of test data to do anomaly detection. However, most of them only use the normal data in the training phase and can not ensure the reconstruction process of anomaly data. So, anomaly data can also be well reconstructed sometimes and gets low reconstruction error, which leads to the omission of anomalies. What's more, the neighbor information of data points in time series data has not been fully utilized in these algorithms. In this paper, we propose RAN based on the idea of Reconstruct Anomalies to Normal and apply it for unsupervised time series anomaly detection. To minimize the reconstruction error of normal data and maximize this of anomaly data, we do not just ensure normal data to reconstruct well, but also try to make the reconstruction of anomaly data consistent with the distribution of normal data, then anomalies will get higher reconstruction errors. We implement this idea by introducing the "imitated anomaly data" and combining a specially designed latent vector-constrained Autoencoder with the discriminator to construct an adversary network. Extensive experiments on time-series datasets from different scenes such as ECG diagnosis also show that RAN can detect meaningful anomalies, and it outperforms other algorithms in terms of AUC-ROC.
翻译:在时间序列中异常的检测已经进行了广泛的研究,并且具有重要的实际应用。近年来,异常的检测算法大多以深学习的基因模型为基础,并且使用重建错误来检测异常。它们试图通过重建培训阶段的正常数据来获取正常数据的分布,然后计算测试数据的重建错误以进行异常的检测。然而,它们大多只使用培训阶段的正常数据,无法确保异常数据的重建过程。因此,异常数据有时也可以得到很好的重建,并且得到低重建错误,从而导致异常现象的消失。此外,时间序列数据中的数据点的相邻信息在这些算法中没有得到充分利用。在本文件中,我们根据重新构建正常数据的理念提出“ARAN ”, 并将其用于不受控制的时间序列的异常检测。为了尽可能减少正常数据的重建错误,我们不仅确保正常的数据得到很好的重建,而且能够使异常数据的重建与正常数据的传播相一致,然后异常现象将更高的时间序列数据数据用于更高级的重建数据序列中。我们根据重新构建的ARAN, 将这种数据引入一个“A级化的模型 ”, 并用一个不甚深层的模型来演示一个“A级的模型, 。