With a variety of local feature attribution methods being proposed in recent years, follow-up work suggested several evaluation strategies. To assess the attribution quality across different attribution techniques, the most popular among these evaluation strategies in the image domain use pixel perturbations. However, recent advances discovered that different evaluation strategies produce conflicting rankings of attribution methods and can be prohibitively expensive to compute. In this work, we present an information-theoretic analysis of evaluation strategies based on pixel perturbations. Our findings reveal that the results are strongly affected by information leakage through the shape of the removed pixels as opposed to their actual values. Using our theoretical insights, we propose a novel evaluation framework termed Remove and Debias (ROAD) which offers two contributions: First, it mitigates the impact of the confounders, which entails higher consistency among evaluation strategies. Second, ROAD does not require the computationally expensive retraining step and saves up to 99% in computational costs compared to the state-of-the-art. We release our source code at https://github.com/tleemann/road_evaluation.
翻译:由于近年来提出了各种地方特性归属方法,后续工作提出了若干评价战略。为了评估不同属性技术的归属质量,在图像域中最受欢迎的评价战略使用像素扰动。然而,最近的进展发现,不同的评价战略产生不同属性方法的等级冲突,而且计算成本过高。在这项工作中,我们对基于像素扰动的评价战略进行了信息理论分析。我们的调查结果显示,通过被移除像素的形状而不是其实际价值,信息渗漏对结果产生了强烈的影响。我们利用我们的理论见解,提出了一个名为“删除”和“Debias(ROAD)”的新的评价框架,提供两种贡献:第一,它减轻了组合者的影响,这就要求评价战略之间更加一致。第二,ROAD不需要计算昂贵的再培训步骤,并且比国家艺术状态节省了高达99%的计算成本。我们发布了我们的源代码,网址是https://github.com/tlemann/rover_evation。