Counterfactual explanations (CFE) are being widely used to explain algorithmic decisions, especially in consequential decision-making contexts (e.g., loan approval or pretrial bail). In this context, CFEs aim to provide individuals affected by an algorithmic decision with the most similar individual (i.e., nearest individual) with a different outcome. However, while an increasing number of works propose algorithms to compute CFEs, such approaches either lack in optimality of distance (i.e., they do not return the nearest individual) and perfect coverage (i.e., they do not provide a CFE for all individuals); or they cannot handle complex models, such as neural networks. In this work, we provide a framework based on Mixed-Integer Programming (MIP) to compute nearest counterfactual explanations with provable guarantees and with runtimes comparable to gradient-based approaches. Our experiments on the Adult, COMPAS, and Credit datasets show that, in contrast with previous methods, our approach allows for efficiently computing diverse CFEs with both distance guarantees and perfect coverage.
翻译:反事实解释(CFE)正在被广泛用于解释算法决定,特别是在相应的决策背景下(例如贷款批准或审前保释);在这方面,CFES旨在向受算法决定影响的个人提供与个人(即最近的个人)最相似的结果;然而,尽管越来越多的工作提议计算CFES的算法,但这种方法要么缺乏最优化的距离(即不返回最近的个人)和完美的覆盖面(即不为所有个人提供CFE);或者他们无法处理复杂的模型,例如神经网络;在这项工作中,我们提供了一个基于混合-Interprogration(MIP)的框架,用可行的保证和与梯度方法相仿的运行时间来计算最接近的反事实解释。我们在成人、COMAS和信用数据集方面的实验表明,与以往的方法不同,我们的方法可以有效地计算各种不同的CFES,同时提供远程保障和完美的覆盖范围。