Fairness and explainability are two important and closely related requirements of decision making systems. While ensuring and evaluating fairness as well as explainability of decision masking systems has been extensively studied independently, only little effort has been investigated into studying fairness of explanations on their own - i.e. the explanations it self should be fair. In this work we formally and empirically study individual fairness and robustness of contrasting explanations - in particular we consider counterfactual explanations as a prominent instance of contrasting explanations. Furthermore, we propose to use plausible counterfactuals instead of closest counterfactuals for improving the individual fairness of counterfactual explanations.
翻译:公平和解释是决策系统两个重要和密切相关的要求。虽然已经独立地广泛研究过确保和评价公正以及决定遮掩系统的可解释性,但对研究解释本身是否公平,即解释本身应当公平,只调查了很少的努力。在这项工作中,我们正式和实证地研究个人公正和对比解释的稳健性,特别是我们认为反事实解释是对比解释的一个突出例子。此外,我们提议使用可信的反事实,而不是最接近的反事实来改进反事实解释的个人公平性。