Recent work on algorithmic fairness has largely focused on the fairness of discrete decisions, or classifications. While such decisions are often based on risk score models, the fairness of the risk models themselves has received considerably less attention. Risk models are of interest for a number of reasons, including the fact that they communicate uncertainty about the potential outcomes to users, thus representing a way to enable meaningful human oversight. Here, we address fairness desiderata for risk score models. We identify the provision of similar epistemic value to different groups as a key desideratum for risk score fairness. Further, we address how to assess the fairness of risk score models quantitatively, including a discussion of metric choices and meaningful statistical comparisons between groups. In this context, we also introduce a novel calibration error metric that is less sample size-biased than previously proposed metrics, enabling meaningful comparisons between groups of different sizes. We illustrate our methodology - which is widely applicable in many other settings - in two case studies, one in recidivism risk prediction, and one in risk of major depressive disorder (MDD) prediction.
翻译:最近关于算法公平性的工作主要侧重于独立决定或分类的公平性。虽然这类决定往往以风险评分模式为基础,但风险评分模式本身的公平性受到的关注要少得多。风险模型之所以有意义,原因很多,包括它们向用户传达潜在结果的不确定性,从而代表了一种使人类能够进行有意义的监督的方法。这里,我们处理的是公平性对风险评分模式的偏差。我们确定向不同群体提供相似的认知值是风险评分的重要偏差。此外,我们讨论如何从数量上评估风险评分模式的公平性,包括讨论各群体之间的指标选择和有意义的统计比较。在这方面,我们还采用了比先前提议的衡量尺度差得多的新的校准误差指标,使不同规模的人群之间能够进行有意义的比较。我们用两种案例研究,一种是累犯风险预测,一种是严重抑制性失调(MDDD)预测,来说明我们的方法,在许多其他情况下广泛适用。