Scoring systems, as simple classification models, have significant advantages in interpretability and transparency when making predictions. They facilitate humans' decision-making by allowing them to make a quick prediction by hand through adding and subtracting a few point scores and thus have been widely used in various fields such as medical diagnosis of Intensive Care Units. However, (un)fairness issues in these models have long been criticized, and the use of biased data in the construction of score systems heightens this concern. In this paper, we propose a general framework to create data-driven fairness-aware scoring systems. Our approach is first to develop a social welfare function that incorporates both efficiency and equity. Then, we translate the social welfare maximization problem in economics into the empirical risk minimization task of the machine learning community to derive a fairness-aware scoring system with the help of mixed integer programming. We show that the proposed framework provides practitioners or policymakers great flexibility to select their desired fairness requirements and also allows them to customize their own requirements by imposing various operational constraints. Experimental evidence on several real data sets verifies that the proposed scoring system can achieve the optimal welfare of stakeholders and balance the interpretability, fairness, and efficiency issues.
翻译:分类系统作为简单的分类模型,在作出预测时具有解释性和透明度方面的重大优势,有助于人类决策,通过增加和减去几个点分数,通过手工快速预测,从而在诸如对集中护理单位的医疗诊断等各个领域广泛使用,然而,长期以来,这些模型中的不公平问题一直受到批评,在建立得分系统时使用偏差数据加剧了这种关切。在本文件中,我们提出了一个创建数据驱动的公平性评分系统的一般框架。我们的方法首先是发展一种社会福利功能,既包括效率又包括公平。然后,我们把经济领域的社会福利最大化问题转化为机构学习社区的经验风险最小化任务,以便利用混合整数规划来形成公平性的评分系统。我们表明,拟议的框架为从业者或决策者提供了选择所希望的公平性要求的巨大灵活性,并使他们能够通过施加各种操作限制来定制自己的要求。关于若干实际数据集的实验性证据证实,拟议的评分系统能够实现利益攸关方的最佳福利,平衡解释性、公正性、有效性和效率问题。