Anomaly detection remains an open challenge in many application areas. While there are a number of available machine learning algorithms for detecting anomalies, analysts are frequently asked to take additional steps in reasoning about the root cause of the anomalies and form actionable hypotheses that can be communicated to business stakeholders. Without the appropriate tools, this reasoning process is time-consuming, tedious, and potentially error-prone. In this paper we present PIXAL, a visual analytics system developed following an iterative design process with professional analysts responsible for anomaly detection. PIXAL is designed to fill gaps in existing tools commonly used by analysts to reason with and make sense of anomalies. PIXAL consists of three components: (1) an algorithm that finds patterns by aggregating multiple anomalous data points using first-order predicates, (2) a visualization tool that allows the analyst to build trust in the algorithmically-generated predicates by performing comparative and counterfactual analyses, and (3) a visualization tool that helps the analyst generate and validate hypotheses by exploring which features in the data most explain the anomalies. Finally, we present the results of a qualitative observational study with professional analysts. These results of the study indicate that PIXAL facilitates the anomaly reasoning process, allowing analysts to make sense of anomalies and generate hypotheses that are meaningful and actionable to business stakeholders.
翻译:在许多应用领域,异常检测仍然是一个公开的挑战。虽然有许多用于检测异常现象的机器学习算法,但经常要求分析人员采取更多步骤,解释异常现象的根源,并形成可操作的假设,以便向商业利益攸关方传达。没有适当的工具,这一推理过程耗时、乏味和可能容易出错。本文介绍PIXAL,这是一个视觉分析系统,它是在与负责异常现象检测的专业分析员一起进行迭接设计过程后开发的。PIXAL旨在填补分析人员通常使用的现有工具中存在的差距,以便理解异常现象。 PIXAL由三个部分组成:(1) 算法通过利用一阶前置集多个异常数据点找到模式,(2) 视觉化工具,使分析人员能够通过进行比较和反事实分析,建立对逻辑生成的前提的信任,(3) 视觉化工具,帮助分析员通过探索数据中哪些特征最能解释异常现象。最后,我们向专业分析人员介绍定性观测研究结果。 PIX,与专业分析师一道,通过综合多个异常点,发现模式发现模式,使分析师能够产生有意义的分析结果。