Convolutional neural networks (CNNs) have recently attracted great attention in geoscience due to their ability to capture non-linear system behavior and extract predictive spatiotemporal patterns. Given their black-box nature however, and the importance of prediction explainability, methods of explainable artificial intelligence (XAI) are gaining popularity as a means to explain the CNN decision-making strategy. Here, we establish an intercomparison of some of the most popular XAI methods and investigate their fidelity in explaining CNN decisions for geoscientific applications. Our goal is to raise awareness of the theoretical limitations of these methods and gain insight into the relative strengths and weaknesses to help guide best practices. The considered XAI methods are first applied to an idealized attribution benchmark, where the ground truth of explanation of the network is known a priori, to help objectively assess their performance. Secondly, we apply XAI to a climate-related prediction setting, namely to explain a CNN that is trained to predict the number of atmospheric rivers in daily snapshots of climate simulations. Our results highlight several important issues of XAI methods (e.g., gradient shattering, inability to distinguish the sign of attribution, ignorance to zero input) that have previously been overlooked in our field and, if not considered cautiously, may lead to a distorted picture of the CNN decision-making strategy. We envision that our analysis will motivate further investigation into XAI fidelity and will help towards a cautious implementation of XAI in geoscience, which can lead to further exploitation of CNNs and deep learning for prediction problems.
翻译:地球科学最近由于有能力捕捉非线性系统行为和提取预测的时空模式而引起地球科学的极大关注。然而,鉴于黑箱的性质以及预测解释的重要性,解释人工智能(XAI)的方法越来越受欢迎,作为解释CNN决策战略的一种手段。在这里,我们建立一些最受欢迎的XAI方法的相互比较,并调查这些方法在解释CNN地球科学应用决定方面的忠贞性。我们的目标是提高对这些方法理论局限性的认识,并深入了解这些方法的相对优缺点,以帮助指导最佳做法。考虑的XAI方法首先应用于一个理想化的归属基准,在这个基准中,人们事先知道对网络的解释的地面真相,以帮助客观地评估其表现。第二,我们将XAI应用到一个气候相关预测环境,即解释CNN在为预测大气河流数量而培训时,在气候模拟的每日快照中将进一步帮助预测大气河流数量。我们的结果突出了XAI方法的若干重要问题(例如,梯级粉碎、无法解释网络的地面真相分析,如果我们没有被考虑的实地分析,那么,我们会将判断错误的实地分析会分化。