Multivariate time series anomaly detection has been extensively studied under the semi-supervised setting, where a training dataset with all normal instances is required. However, preparing such a dataset is very laborious since each single data instance should be fully guaranteed to be normal. It is, therefore, desired to explore multivariate time series anomaly detection methods based on the dataset without any label knowledge. In this paper, we propose MTGFlow, an unsupervised anomaly detection approach for Multivariate Time series anomaly detection via dynamic Graph and entity-aware normalizing Flow, leaning only on a widely accepted hypothesis that abnormal instances exhibit sparse densities than the normal. However, the complex interdependencies among entities and the diverse inherent characteristics of each entity pose significant challenges on the density estimation, let alone to detect anomalies based on the estimated possibility distribution. To tackle these problems, we propose to learn the mutual and dynamic relations among entities via a graph structure learning model, which helps to model accurate distribution of multivariate time series. Moreover, taking account of distinct characteristics of the individual entities, an entity-aware normalizing flow is developed to describe each entity into a parameterized normal distribution, thereby producing fine-grained density estimation. Incorporating these two strategies, MTGFlowachieves superior anomaly detection performance. Experiments on the real-world datasets are conducted, demonstrating that MTGFlow outperforms the state-of-the-art (SOTA) by 5.0% and 1.6% AUROC for SWaT and WADI datasets respectively. Also, through the anomaly scores contributed by individual entities, MTGFlow can provide explanation information for the detection results.
翻译:在半监督环境下广泛研究了多变时间序列异常现象的检测,在半监督环境下需要有一个包含所有正常情况的培训数据集。然而,编制这样一个数据集非常费力,因为每个单一数据实例都应完全保证是正常的。因此,它希望探索基于数据集的多变时间序列异常现象检测方法,而没有任何标签知识。在本文件中,我们提议采用多变时间序列异常现象检测方法,即通过动态图表和实体认知正常流动,对多变时间序列异常现象检测方法不加监督,仅依靠一种广泛接受的假设,即异常情况比正常情况密度都低。然而,各实体之间复杂的相互依存关系和每个实体的不同固有特征应充分保证正常。为了解决这些问题,我们提议通过图表结构学习模型学习各实体之间的相互和动态关系,这有助于模拟多变时间序列的准确分布。 此外,考虑到各个实体的特性,实体对正常数据流的正常程度比正常。 各个实体的正常数据流将分别通过SGFOFMT的正常运行模式进行。