In the research area of anomaly detection, novel and promising methods are frequently developed. However, most existing studies exclusively focus on the detection task only and ignore the interpretability of the underlying models as well as their detection results. However, anomaly interpretation, which aims to provide explanation of why specific data instances are identified as anomalies, is an equally important task in many real-world applications. In this work, we propose a novel framework which synergizes several machine learning and data mining techniques to automatically learn invariant rules that are consistently satisfied in the training data. The learned invariant rules can provide explicit explanation of anomaly detection results and thus are extremely useful for subsequent decision-making regarding reported anomalies. Furthermore, our empirical evaluation shows that the proposed method can also achieve comparable or even better performance in terms of AUC and partial AUC on public benchmark datasets across various application domains compared with start-of-the-art anomaly detection models.
翻译:在发现异常现象的研究领域,经常开发出新颖和有希望的方法,然而,大多数现有研究只侧重于探测任务,忽视基本模型及其检测结果的可解释性;然而,异常现象解释旨在解释为什么具体数据被确定为异常现象,在许多现实世界应用中,这是同样重要的任务;在这项工作中,我们提出了一个新颖的框架,使若干机器学习和数据挖掘技术协同起来,自动学习在培训数据中始终满足的不变规则;所学的不规则可以对异常现象检测结果作出明确解释,从而对随后就所报告的异常现象作出决定极为有用;此外,我们的经验评估表明,拟议的方法还可以在奥地利联合自卫军和部分奥地利联合自卫军方面,在各种应用领域的公共基准数据集方面实现可比或甚至更好的业绩,而与最初的异常现象检测模型相比。