In domains such as criminal justice, medicine, and social welfare, decision makers increasingly have access to algorithmic Risk Assessment Instruments (RAIs). RAIs estimate the risk of an adverse outcome such as recidivism or child neglect, potentially informing high-stakes decisions such as whether to release a defendant on bail or initiate a child welfare investigation. It is important to ensure that RAIs are fair, so that the benefits and harms of such decisions are equitably distributed. The most widely used algorithmic fairness criteria are formulated with respect to observable outcomes, such as whether a person actually recidivates, but these criteria are misleading when applied to RAIs. Since RAIs are intended to inform interventions that can reduce risk, the prediction itself affects the downstream outcome. Recent work has argued that fairness criteria for RAIs should instead utilize potential outcomes, i.e. the outcomes that would occur in the absence of an appropriate intervention. However, no methods currently exist to satisfy such fairness criteria. In this paper, we target one such criterion, counterfactual equalized odds. We develop a post-processed predictor that is estimated via doubly robust estimators, extending and adapting previous post-processing approaches to the counterfactual setting. We also provide doubly robust estimators of the risk and fairness properties of arbitrary fixed post-processed predictors. Our predictor converges to an optimal fair predictor at fast rates. We illustrate properties of our method and show that it performs well on both simulated and real data.
翻译:在刑事司法、医学和社会福利等领域,决策者越来越多地有机会利用算法风险评估工具(RAI)。RAI估计了累犯或忽视儿童等不利结果的风险,这有可能为作出重大决定提供依据,例如是否保释被告或启动儿童福利调查。重要的是确保RAIs是公平的,以便公平地分配这类决定的好处和危害。最广泛使用的算法公平标准是在可观察结果方面制定的,例如一个人是否实际上重新恢复了算法风险评估工具,但在对RAIs适用这些标准时,这些标准具有误导性。由于RAIs的目的是为能够减少风险的干预措施提供信息,预测本身就影响到下游结果。最近的工作表明,RAIs的公平标准应该利用潜在结果,即在没有适当干预的情况下可能产生的结果。然而,目前没有方法可以满足这种公平标准。在本文中,我们针对一个这样的标准,即反现实的相同概率。我们开发了一个后处理的预测器,通过加倍有力的估测算器来估算,我们之前的稳健的准确性预测方法也提供了我们之前的准确性预测方法,我们之前的精确的精确性预测率和精确的精确的预测结果。