We present a new method for counterfactual explanations (CFEs) based on Bayesian optimisation that applies to both classification and regression models. Our method is a globally convergent search algorithm with support for arbitrary regression models and constraints like feature sparsity and actionable recourse, and furthermore can answer multiple counterfactual questions in parallel while learning from previous queries. We formulate CFE search for regression models in a rigorous mathematical framework using differentiable potentials, which resolves robustness issues in threshold-based objectives. We prove that in this framework, (a) verifying the existence of counterfactuals is NP-complete; and (b) that finding instances using such potentials is CLS-complete. We describe a unified algorithm for CFEs using a specialised acquisition function that composes both expected improvement and an exponential-polynomial (EP) family with desirable properties. Our evaluation on real-world benchmark domains demonstrate high sample-efficiency and precision.
翻译:我们提出了一种基于巴伊西亚优化的反事实解释新方法,该方法适用于分类和回归模型。我们的方法是一种全球趋同搜索算法,支持任意回归模型和特质宽度和可采取行动的追索等限制因素,还可以同时回答多重反事实问题,同时从以前的查询中学习。我们利用不同的潜力在严格的数学框架内制定回归模型,解决基于临界目标的稳健性问题。我们证明,在这个框架内,(a) 核实反事实的存在是NP-完整的;以及(b) 发现使用这种潜力的例子是CLS-完整的。我们描述了使用专门购置功能的CFES统一算法,它既构成预期的改进,又构成具有理想特性的指数-极化(EP)家庭。我们对现实世界基准领域的评估表明高采样效率和精确度。