In this paper we suggest NICE: a new algorithm to generate counterfactual explanations for heterogeneous tabular data. The design of our algorithm specifically takes into account algorithmic requirements that often emerge in real-life deployments: (1) the ability to provide an explanation for all predictions, (2) being able to handle any classification model (also non-differentiable ones), and (3) being efficient in run time. More specifically, our approach exploits information from a nearest unlike neighbour to speed up the search process, by iteratively introducing feature values from this neighbour in the instance to be explained. We propose four versions of NICE, one without optimization and, three which optimize the explanations for one of the following properties: sparsity, proximity or plausibility. An extensive empirical comparison on 40 datasets shows that our algorithm outperforms the current state-of-the-art in terms of these criteria. Our analyses show a trade-off between on the one hand plausibility and on the other hand proximity or sparsity, with our different optimization methods offering users the choice to select the types of counterfactuals that they prefer. An open-source implementation of NICE can be found at https://github.com/ADMAntwerp/NICE.
翻译:在本文中,我们建议 NICE : 一种新的算法,为多种列表数据提供反事实解释。我们的算法设计特别考虑到现实生活中经常出现的算法要求:(1) 能够解释所有预测,(2) 能够处理任何分类模式(也是非区别的模型),(3) 运行时效率高。更具体地说,我们的方法利用来自近邻的远邻的信息加快搜索过程,在解释时反复引入来自这个邻国的特征值。我们提出了四种版本 NICE,一个没有优化,三个,优化了以下属性之一的解释:(1) 简单性、近似性或合理性。对40套数据集的广泛实验性比较表明,我们的算法在这些标准方面超越了目前的最新状态。我们的分析表明,在一方面是可信赖性,另一方面是亲近性或偏狭性,我们不同的最优化方法为用户选择选择了他们喜欢的反事实类型。NICE/ANDIS/ADRMER的开放源实施方式可以在 httpAD/ADGIMS/ADRMER.