Counterfactual explanations are a popular type of explanation for making the outcomes of a decision making system transparent to the user. Counterfactual explanations tell the user what to do in order to change the outcome of the system in a desirable way. However, it was recently discovered that the recommendations of what to do can differ significantly in their complexity between protected groups of individuals. Providing more difficult recommendations of actions to one group leads to a disadvantage of this group compared to other groups. In this work we propose a model-agnostic method for computing counterfactual explanations that do not differ significantly in their complexity between protected groups.
翻译:反事实解释是使决策系统的结果对用户透明的一种流行解释。反事实解释告诉用户应做些什么,以便以可取的方式改变系统的结果。然而,最近发现,关于应做什么的建议在受保护的个人群体之间的复杂性可能大相径庭。向一个群体提出更困难的行动建议,导致该群体与其他群体相比处于不利地位。在这项工作中,我们提出了计算反事实解释的模型----不可知性方法,在受保护群体之间,其复杂性没有多大差别。