While a lot of research in explainable AI focuses on producing effective explanations, less work is devoted to the question of how people understand and interpret the explanation. In this work, we focus on this question through a study of saliency-based explanations over textual data. Feature-attribution explanations of text models aim to communicate which parts of the input text were more influential than others towards the model decision. Many current explanation methods, such as gradient-based or Shapley value-based methods, provide measures of importance which are well-understood mathematically. But how does a person receiving the explanation (the explainee) comprehend it? And does their understanding match what the explanation attempted to communicate? We empirically investigate the effect of various factors of the input, the feature-attribution explanation, and visualization procedure, on laypeople's interpretation of the explanation. We query crowdworkers for their interpretation on tasks in English and German, and fit a GAMM model to their responses considering the factors of interest. We find that people often mis-interpret the explanations: superficial and unrelated factors, such as word length, influence the explainees' importance assignment despite the explanation communicating importance directly. We then show that some of this distortion can be attenuated: we propose a method to adjust saliencies based on model estimates of over- and under-perception, and explore bar charts as an alternative to heatmap saliency visualization. We find that both approaches can attenuate the distorting effect of specific factors, leading to better-calibrated understanding of the explanation.
翻译:在可解释的AI中,大量研究侧重于提供有效解释,但较少的工作致力于人们如何理解和解释解释解释的问题。在这项工作中,我们通过对基于文字数据的突出解释的研究来关注这一问题。文本模型的特征推理解释旨在交流投入文本中哪些部分对示范决定的影响大于其他部分。许多当前的解释方法,如梯度法或光滑价值法等,提供了在数学上非常清楚的衡量标准。但是,接受解释的人(解释者)如何理解它?他们的理解与解释试图传达的内容相符吗?我们从经验上调查了投入的各种因素、特征推理解释和直观化程序对外行人解释模型对示范决定的影响。我们询问观众如何解释英语和德语的任务,并且让GAMMM模型与其考虑到利益因素的反应相适应。我们发现,人们常常会错误地理解这些解释:表面和不相关的因素,如字长度法解释,他们的理解是否与解释相符?我们从这些角度来研究这些解释解释解释的重要性:我们可以通过解释方式来更好地解释这些解释,尽管我们直接解释这种推理方法。