Explainable artificial intelligence (XAI) methods shed light on the predictions of deep neural networks (DNNs). Several different approaches exist and have partly already been successfully applied in climate science. However, the often missing ground truth explanations complicate their evaluation and validation, subsequently compounding the choice of the XAI method. Therefore, in this work, we introduce XAI evaluation in the context of climate research and assess different desired explanation properties, namely, robustness, faithfulness, randomization, complexity, and localization. To this end we build upon previous work and train a multi-layer perceptron (MLP) and a convolutional neural network (CNN) to predict the decade based on annual-mean temperature maps. Next, multiple local XAI methods are applied and their performance is quantified for each evaluation property and compared against a baseline test. Independent of the network type, we find that the XAI methods Integrated Gradients, Layer-wise relevance propagation, and InputGradients exhibit considerable robustness, faithfulness, and complexity while sacrificing randomization. The opposite is true for Gradient, SmoothGrad, NoiseGrad, and FusionGrad. Notably, explanations using input perturbations, such as SmoothGrad and Integrated Gradients, do not improve robustness and faithfulness, contrary to previous claims. Overall, our experiments offer a comprehensive overview of different properties of explanation methods in the climate science context and supports users in the selection of a suitable XAI method.
翻译:可解释的人工智能(XAI)方法揭示了对深神经网络(DNNs)的预测。有几种不同的方法存在,部分已经成功地应用于气候科学。然而,往往缺少的地面真相解释使其评估和验证复杂化,随后又增加了XAI方法的选择。因此,在这项工作中,我们在气候研究背景下引入XAI评估,并评估不同的理想解释属性,即稳健性、忠诚性、随机化、复杂性和本地化。为此,我们在以往工作的基础上再接再厉,并培训多层概念(MLP)和动态神经网络(CNN),以根据年均温度图预测十年。接下来,采用多处当地XAI方法,对每项评估属性进行量化,并与基线测试进行比较。我们发现,XAI方法采用综合渐进性、多层相关性的传播和投入,在牺牲随机化的同时,表现出相当强的力度、忠诚性和复杂性。对于Graid、SlipGAGGA和FFlorality的用户来说,情况正好相反,采用不同的解释方法。Gral-Gravelopations、Gravelrientalalalal、Grualalalal、Gravientalalalalalalation、Groislations、Grualislations、不甚甚甚甚甚甚甚甚甚甚甚甚甚甚甚甚甚甚甚甚甚,我们。</s>