This paper considers an anomaly detection problem in which a detection algorithm assigns anomaly scores to multi-dimensional data points, such as cellular networks' Key Performance Indicators (KPIs). We propose an optimization framework to refine these anomaly scores by leveraging side information in the form of a causality graph between the various features of the data points. The refinement block builds on causality theory and a proposed notion of confidence scores. After motivating our framework, smoothness properties are proved for the ensuing mathematical expressions. Next, equipped with these results, a gradient descent algorithm is proposed, and a proof of its convergence to a stationary point is provided. Our results hold (i) for any causal anomaly detection algorithm and (ii) for any side information in the form of a directed acyclic graph. Numerical results are provided to illustrate the advantage of our proposed framework in dealing with False Positives (FPs) and False Negatives (FNs). Additionally, the effect of the graph's structure on the expected performance advantage and the various trade-offs that take place are analyzed.
翻译:本文考虑异常检测问题,其中检测算法为多维数据点(如蜂窝网络关键性能指标)分配异常得分。 我们提出了一个优化框架,通过利用因果关系图来推广这些异常得分。 精化块基于因果论和提出的置信度得分的概念。 在阐明我们的框架后,对所得的数学表达式证明了光滑性质。 接下来,在这些结果的支持下,提出了一种梯度下降算法,并提供了其收敛到稳定点的证明。 我们的结果适用于(i)任何因果异常检测算法和(ii)以有向无环图形式的任何附加信息。 提供了数值结果,以说明我们提出的框架在处理误报和漏报方面的优势。 此外,分析了图形结构对预期性能提高和各种权衡的影响。