In the research area of anomaly detection, novel and promising methods are frequently developed. However, most existing studies exclusively focus on the detection task only and ignore the interpretability of the underlying models as well as their detection results. Nevertheless, anomaly interpretation, which aims to provide explanation of why specific data instances are identified as anomalies, is an equally important task in many real-world applications. In this work, we propose a novel framework which synergizes several machine learning and data mining techniques to automatically learn invariant rules that are consistently satisfied in a given dataset. The learned invariant rules can provide explicit explanation of anomaly detection results in the inference phase and thus are extremely useful for subsequent decision-making regarding reported anomalies. Furthermore, our empirical evaluation shows that the proposed method can also achieve comparable or even better performance in terms of AUC and partial AUC on public benchmark datasets across various application domains compared with start-of-the-art anomaly detection models.
翻译:在发现异常现象的研究领域,经常开发出新的和有希望的方法,然而,大多数现有研究只侧重于探测任务,忽视基本模型及其检测结果的可解释性,然而,异常解释旨在解释为什么具体数据被确定为异常现象,在许多现实世界应用中,这是同样重要的任务。在这项工作中,我们提出了一个新颖的框架,将若干机学和数据挖掘技术协同起来,自动学习在特定数据集中始终满足的不固定规则。所学的不固定规则可以明确解释推断阶段的异常现象检测结果,因此对随后就所报告异常现象作出决定极为有用。此外,我们的经验评估表明,拟议的方法还可以在AUC和部分ACU在各种应用领域的公共基准数据集方面实现可比或甚至更好的业绩,而与最初的异常现象探测模型相比。