Counterfactual fairness alleviates the discrimination between the model prediction toward an individual in the actual world (observational data) and that in counterfactual world (i.e., what if the individual belongs to other sensitive groups). The existing studies need to pre-define the structural causal model that captures the correlations among variables for counterfactual inference; however, the underlying causal model is usually unknown and difficult to be validated in real-world scenarios. Moreover, the misspecification of the causal model potentially leads to poor performance in model prediction and thus makes unfair decisions. In this research, we propose a novel minimax game-theoretic model for counterfactual fairness that can produce accurate results meanwhile achieve a counterfactually fair decision with the relaxation of strong assumptions of structural causal models. In addition, we also theoretically prove the error bound of the proposed minimax model. Empirical experiments on multiple real-world datasets illustrate our superior performance in both accuracy and fairness. Source code is available at \url{https://github.com/tridungduong16/counterfactual_fairness_game_theoretic}.
翻译:不完善的结构因果模型的对抗事实公平性可以缓解模型在实际世界(观测数据)和反事实世界(即如果个体属于其他敏感群体)中的个体预测之间的歧视。现有的研究需要预定义捕捉变量之间相关性的结构因果模型,以进行反事实推断。然而,潜在的因果模型通常是未知的,在现实场景中很难验证。此外,因果模型的错误规定可能导致模型预测性能差,从而做出不公平的决策。在这项研究中,我们提出了一种能够通过博弈论模型实现对抗事实公平的缺点结构因果模型,能够在没有结构因果模型的强假设的情况下产生准确的结果。此外,我们还在理论上证明了所提出的极小极大模型的误差界。对多个真实世界数据集的实证实验显示出了我们在准确度和公平方面的优越性。源代码可在\url{https://github.com/tridungduong16/counterfactual_fairness_game_theoretic}找到。