In anomaly detection, the degree of irregularity is often summarized as a real-valued anomaly score. We address the problem of attributing such anomaly scores to input features for interpreting the results of anomaly detection. We particularly investigate the use of the Shapley value for attributing anomaly scores of semi-supervised detection methods. We propose a characteristic function specifically designed for attributing anomaly scores. The idea is to approximate the absence of some features by locally minimizing the anomaly score with regard to the to-be-absent features. We examine the applicability of the proposed characteristic function and other general approaches for interpreting anomaly scores on multiple datasets and multiple anomaly detection methods. The results indicate the potential utility of the attribution methods including the proposed one.
翻译:在检测异常时,非正常程度往往被概括为实际价值的异常分数。我们处理将异常分数归为解释异常分数结果的投入特征的问题。我们特别调查使用沙普利值来分配半监督的检测方法异常分数的问题。我们建议了专门为分错异常分数而设计的特性性功能。目的是通过在当地尽量减少与待缺特征有关的异常分数来估计某些缺分数。我们研究了拟议特性功能和其他一般性方法在解释多个数据集和多异常分数方法上的异常分数时的适用性。结果显示了归属方法的潜在效用,包括拟议特性方法。