Statistical risk assessments inform consequential decisions such as pretrial release in criminal justice, and loan approvals in consumer finance. Such risk assessments make counterfactual predictions, predicting the likelihood of an outcome under a proposed decision (e.g., what would happen if we approved this loan?). A central challenge, however, is that there may have been unmeasured confounders that jointly affected past decisions and outcomes in the historical data. This paper proposes a tractable mean outcome sensitivity model that bounds the extent to which unmeasured confounders could affect outcomes on average. The mean outcome sensitivity model partially identifies the conditional likelihood of the outcome under the proposed decision, popular predictive performance metrics (e.g., accuracy, calibration, TPR, FPR), and commonly-used predictive disparities. We derive their sharp identified sets, and we then solve three tasks that are essential to deploying statistical risk assessments in high-stakes settings. First, we propose a doubly-robust learning procedure for the bounds on the conditional likelihood of the outcome under the proposed decision. Second, we translate our estimated bounds on the conditional likelihood of the outcome under the proposed decision into a robust, plug-in decision-making policy. Third, we develop doubly-robust estimators of the bounds on the predictive performance of an existing risk assessment.
翻译:统计风险评估为刑事司法中的审前释放和消费融资中的贷款批准等相应决定提供了依据。这类风险评估作出了反事实预测,预测了根据拟议决定(例如,如果我们批准这一贷款将会发生什么??)产生结果的可能性。然而,一个中心挑战是,可能存在一些非计量的混乱者,共同影响历史数据中过去的决定和结果。本文件提出了一个可移动的、潜在的结果敏感性模型,该模型将非计量的混乱者对平均结果影响的程度加以限制。平均结果敏感性模型部分确定了拟议决定下的结果的有条件可能性、流行的预测性业绩衡量标准(例如,准确性、校准、TRPR、FPR)和常用的预测性差异。我们从中找出了它们所查明的明细的组合,然后我们解决了三个对于在高比例环境中部署统计风险评估至关重要的任务。首先,我们提出了对拟议决定下的结果的有条件可能性进行加倍的学习程序。第二,我们将我们对根据拟议的预测结果的有条件可能性的估计界限(例如,精确性、精确性、精确性、预测性),我们把目前对作出预测的预测性决定的有条件结果的界限转换成一个稳健的上限。