Machine Learning models are increasingly used for decision making, in particular in high-stakes applications such as credit scoring, medicine or recidivism prediction. However, there are growing concerns about these models with respect to their lack of interpretability and the undesirable biases they can generate or reproduce. While the concepts of interpretability and fairness have been extensively studied by the scientific community in recent years, few works have tackled the general multi-class classification problem under fairness constraints, and none of them proposes to generate fair and interpretable models for multi-class classification. In this paper, we use Mixed-Integer Linear Programming (MILP) techniques to produce inherently interpretable scoring systems under sparsity and fairness constraints, for the general multi-class classification setup. Our work generalizes the SLIM (Supersparse Linear Integer Models) framework that was proposed by Rudin and Ustun to learn optimal scoring systems for binary classification. The use of MILP techniques allows for an easy integration of diverse operational constraints (such as, but not restricted to, fairness or sparsity), but also for the building of certifiably optimal models (or sub-optimal models with bounded optimality gap).
翻译:机器学习模型越来越被用于决策,特别是在高风险应用中,例如信用评分、医学或再犯预测。然而,人们越来越关心这些模型在可解释性方面的缺陷以及它们可能产生或再现的不良偏差。虽然解释性和公平性的概念近年来已经被科学界广泛研究,但很少有作品在公平性约束下研究一般的多类别分类问题,也没有一个提出为多类别分类生成公平且可解释的模型。本文使用混合整数线性规划(MILP)技巧,针对一般的多类别分类设置,提出一种在稀疏性和公平性约束下生成固有可解释评分系统的方法。我们的工作广泛化了Rudin和Ustun提出的用于学习二分类最优评分系统的SLIM(超稀疏线性整数模型)框架。MILP技术的使用可以轻松集成各种操作约束(例如公平性或稀疏性),也可以构建具有可证明最优性的模型(或具有有界最优性差距的次优模型)。