统一公平分类的最佳分数转换 (Optimized Score Transformation for Consistent Fair Classification)

from arxiv, 78 pages, 16 figures. Published in Journal of Machine Learning Research. Earlier version published at the 2020 International Conference on Artificial Intelligence and Statistics (AISTATS)

This paper considers fair probabilistic binary classification where the outputs of primary interest are predicted probabilities, commonly referred to as scores. We formulate the problem of transforming scores to satisfy fairness constraints that are linear in conditional means of scores while minimizing a cross-entropy objective. The formulation can be applied directly to post-process classifier outputs and we also explore a pre-processing extension, thus allowing maximum freedom in selecting a classification algorithm. We derive a closed-form expression for the optimal transformed scores and a convex optimization problem for the transformation parameters. In the population limit, the transformed score function is the fairness-constrained minimizer of cross-entropy with respect to the true conditional probability of the outcome. In the finite sample setting, we propose a method called FairScoreTransformer to approach this solution using a combination of standard probabilistic classifiers and ADMM. We provide several consistency and finite-sample guarantees for FairScoreTransformer, relating to the transformation parameters and transformed score function that it obtains. Comprehensive experiments comparing to 10 existing methods show that FairScoreTransformer has advantages for score-based metrics such as Brier score and AUC while remaining competitive for binary label-based metrics such as accuracy.

翻译：本文考虑了一种公平的概率二进制分类,其中主要利益产出是预测概率的,通常称为分数。我们提出了转换分数的问题,以满足在有条件的得分手段中线性的公平限制,同时尽量减少交叉消耗目标。这种提法可以直接适用于后处理分类产出,我们还探讨了一个预处理扩展,从而允许在选择分类算法方面享有最大程度的自由。我们为最优转换分得出一种封闭式表达式表达式,为转换参数找出一个曲线优化优化优化优化优化问题。在人口限制方面,改变分数的功能是公平限制地尽量减少对结果的真正有条件概率的交叉消耗。在有限的抽样设置中,我们提议一种称为公平-分数透明度的方法,以使用标准概率分类师和ADMM 的组合来接近这一解决办法。我们为公平-核心分数转换参数和转换参数的改变性分数功能提供了若干一致性和有限性保证。在对10种现有方法进行比较后,将显示公平-核心变分数在基于分数的衡量指标上具有优势,例如Brierb的评分和A-CRin-CRusteral标签。