Anomaly detection is a widely studied task for a broad variety of data types; among them, multiple time series appear frequently in applications, including for example, power grids and traffic networks. Detecting anomalies for multiple time series, however, is a challenging subject, owing to the intricate interdependencies among the constituent series. We hypothesize that anomalies occur in low density regions of a distribution and explore the use of normalizing flows for unsupervised anomaly detection, because of their superior quality in density estimation. Moreover, we propose a novel flow model by imposing a Bayesian network among constituent series. A Bayesian network is a directed acyclic graph (DAG) that models causal relationships; it factorizes the joint probability of the series into the product of easy-to-evaluate conditional probabilities. We call such a graph-augmented normalizing flow approach GANF and propose joint estimation of the DAG with flow parameters. We conduct extensive experiments on real-world datasets and demonstrate the effectiveness of GANF for density estimation, anomaly detection, and identification of time series distribution drift.
翻译:异常探测是广泛研究的多种数据类型的任务;其中,多个时间序列在应用中经常出现,例如电网和交通网络。但是,由于组成序列之间错综复杂的相互依存性,发现多个时间序列的异常是一个具有挑战性的主题。我们假设异常出现在分布的低密度区域,并探索利用正常流来进行不受监督的异常检测,因为其密度估计质量较高。此外,我们提出一个新的流动模型,在组成序列中强制设置一个巴伊西亚网络。一个巴伊西亚网络是一个定向的循环图(DAG),用来模拟因果关系;它将该序列的共同概率纳入易于评价的有条件概率产品中。我们称之为图表式的正常流动方法GANF,并提议用流量参数对DAG进行联合估计。我们对真实世界数据集进行广泛的实验,并展示GNF在密度估计、异常检测和确定时间序列分布流流中的有效性。