With a variety of local feature attribution methods being proposed in recent years, follow-up work suggested several evaluation strategies. To assess the attribution quality across different attribution techniques, the most popular among these evaluation strategies in the image domain use pixel perturbations. However, recent advances discovered that different evaluation strategies produce conflicting rankings of attribution methods and can be prohibitively expensive to compute. In this work, we present an information-theoretic analysis of evaluation strategies based on pixel perturbations. Our findings reveal that the results output by different evaluation strategies are strongly affected by information leakage through the shape of the removed pixels as opposed to their actual values. Using our theoretical insights, we propose a novel evaluation framework termed Remove and Debias (ROAD) which offers two contributions: First, it mitigates the impact of the confounders, which entails higher consistency among evaluation strategies. Second, ROAD does not require the computationally expensive retraining step and saves up to 99% in computational costs compared to the state-of-the-art. Our source code is available at https://github.com/tleemann/road_evaluation.
翻译:近些年来,由于提出了各种地方特性归属方法,后续工作建议了若干评价战略。为了评估不同属性技术的归属质量,在图像域中最受欢迎的评价战略使用像素扰动。然而,最近的进展发现,不同的评价战略产生不同的归属方法等级冲突,而且计算成本过高。在这项工作中,我们对以像素扰动为基础的评价战略进行了信息理论分析。我们的调查结果显示,不同评价战略的结果产出受到信息渗漏的严重影响,信息渗漏是通过被移除的像素的形状,而不是它们的实际价值。我们利用我们的理论见解,提出了一个称为 " 移除 " 和 " Debials(ROAD)的新的评价框架,它提供两种贡献:第一,它减轻了组合者的影响,这就要求评价战略之间更加一致。第二,ROAD不需要计算昂贵的再培训步骤,并且比州-艺术的计算成本节省到99%。我们的源代码可在https://github.com/tleen/road_view中查阅。