Algorithmic Fairness is an established area of machine learning, willing to reduce the influence of biases in the data. Yet, despite its wide range of applications, very few works consider the multi-class classification setting from the fairness perspective. We address this question by extending the definition of Demographic Parity to the multi-class problem while specifying the corresponding expression of the optimal fair classifier. This suggests a plug-in data-driven procedure, for which we establish theoretical guarantees. Specifically, we show that the enhanced estimator mimics the behavior of the optimal rule, both in terms of fairness and risk. Notably, fairness guarantee is distribution-free. We illustrate numerically the quality of our algorithm. The procedure reveals to be much more suitable than an alternative approach enforcing fairness constraints on the score associated to each class. This shows that our method is empirically very effective in fair decision making on both synthetic and real datasets.
翻译:算法公平是一个公认的机器学习领域,愿意减少数据偏见的影响。然而,尽管应用范围很广,但很少有工作从公平角度考虑多级分类设置。我们通过将人口均等的定义扩大到多级问题,同时指定最佳公平分类者的相应表达方式来解决这一问题。这显示了一个插头数据驱动程序,为此我们建立了理论保障。具体地说,我们表明,强化的估算器模仿了最佳规则的行为,既包括公平又包括风险。值得注意的是,公平保障是无分配的。我们用数字来说明我们的算法质量。这个程序表明,比对每个类相关分数实行公平限制的替代方法更合适。这显示,我们的方法在对合成和真实数据集作出公平决策时,经验上非常有效。