Algorithmic Fairness is an established area of machine learning, willing to reduce the influence of biases in the data. Yet, despite its wide range of applications, very few works consider the multi-class classification setting from the fairness perspective. We extend both definitions of exact and approximate fairness in the case of Demographic Parity to multi-class classification. We specify the corresponding expressions of the optimal fair classifiers. This suggests a plug-in data-driven procedure, for which we establish theoretical guarantees. The enhanced estimator is proved to mimic the behavior of the optimal rule both in terms of fairness and risk. Notably, fairness guarantees are distribution-free. The approach is evaluated on both synthetic and real datasets and turns out to be very effective in decision making with a preset level of unfairness. In addition, our method is competitive with the state-of-the-art in-processing fairlearn in the specific binary classification setting.
翻译:算法公平是一个公认的机器学习领域,愿意减少数据偏见的影响。然而,尽管其应用范围很广,但很少有工作从公平的角度考虑多级分类设置。我们将人口均等的准确和近似公平的定义扩展至多级分类。我们指定了最佳公平分类者的相应表达方式。这表示了一个插接数据驱动程序,为此我们建立了理论保障。经证明,强化的估算器在公平和风险两方面都模仿了最佳规则的行为。值得注意的是,公平保障是无分配的。这种方法在合成和真实的数据集上都进行了评价,并证明在决策中非常有效,而且具有预先设定的不公平程度。此外,我们的方法与特定二元分类环境中的处理交易中的最新工艺竞争。