Decision makers increasingly rely on algorithmic risk scores to determine access to binary treatments including bail, loans, and medical interventions. In these settings, we reconcile two fairness criteria that were previously shown to be in conflict: calibration and error rate equality. In particular, we derive necessary and sufficient conditions for the existence of calibrated scores that yield classifications achieving equal error rates at any given group-blind threshold. We then present an algorithm that searches for the most accurate score subject to both calibration and minimal error rate disparity. Applied to the COMPAS criminal risk assessment tool, we show that our method can eliminate error disparities while maintaining calibration. In a separate application to credit lending, we compare our procedure to the omission of sensitive features and show that it raises both profit and the probability that creditworthy individuals receive loans.
翻译:决策者越来越依赖算法风险分数来确定获得包括保释、贷款和医疗干预在内的二元治疗的机会。在这些环境中,我们调和了先前显示有冲突的两个公平标准:校准和误差率平等。特别是,我们为存在经校准的分数创造必要和充分的条件,这些分数可以得出在任何特定群体-盲点阈值上达到相同误差率的校准分数。然后我们提出一种算法,在校准和最低误差率差异下搜索最准确的分数。我们应用到 COMPAS 犯罪风险评估工具,我们证明我们的方法可以消除误差差异,同时保持校准。在单独应用信贷贷款时,我们将我们的程序与敏感特征的遗漏进行比较,并表明它既提高了利润,也提高了信誉良好的个人获得贷款的可能性。