Statistical risk assessments inform consequential decisions, such as pretrial release in criminal justice and loan approvals in consumer finance, by counterfactually predicting an outcome under a proposed decision (e.g., would the applicant default if we approved this loan?). There may, however, have been unmeasured confounders that jointly affected decisions and outcomes in the historical data. We propose a mean outcome sensitivity model that bounds the extent to which unmeasured confounders could affect outcomes on average. The mean outcome sensitivity model partially identifies the conditional likelihood of the outcome under the proposed decision, popular predictive performance metrics, and predictive disparities. We derive their identified sets and develop procedures for the confounding-robust learning and evaluation of statistical risk assessments. We propose a nonparametric regression procedure for the bounds on the conditional likelihood of the outcome under the proposed decision, and estimators for the bounds on predictive performance and disparities. Applying our methods to a real-world credit-scoring task from a large Australian financial institution, we show how varying assumptions on unmeasured confounding lead to substantive changes in the credit score's predictions and evaluations of its predictive disparities.
翻译:统计风险评估通过反事实预测拟议决定的结果(例如,如果我们批准这笔贷款,申请人会违约吗?),从而为相应的决定提供依据,例如刑事司法审前释放和消费融资贷款核准等。 然而,可能存在一些无法计量的混乱者,共同影响历史数据中的决定和结果。我们提出了一个潜在的结果敏感性模型,该模型将未计量的混乱者在多大程度上平均影响结果。平均结果敏感性模型部分地确定了拟议决定的结果的有条件可能性、大众预测性能指标和预测性差异。我们为这些已查明的数据集制定了程序,并制定了统计风险评估的混乱学习和评估程序。我们提出了关于拟议决定中的结果的有条件可能性的无参数回归程序,以及预测预测性业绩和差异的界限的估算者。将我们的方法应用于澳大利亚大型金融机构对真实世界信用的分辨任务,我们提出了如何不同不同的假设,未计量的假设导致信用分数预测的预测和对差距的预测的预测。