Background: Due to the finite size of the development sample, predicted probabilities from a risk prediction model are inevitably uncertain. We apply Value of Information methodology to evaluate the decision-theoretic implications of prediction uncertainty. Methods: Adopting a Bayesian perspective, we extend the definition of the Expected Value of Perfect Information (EVPI) from decision analysis to net benefit calculations in risk prediction. In the context of model development, EVPI is the expected gain in net benefit by using the correct predictions as opposed to predictions from a proposed model. We suggest bootstrap methods for sampling from the posterior distribution of predictions for EVPI calculation using Monte Carlo simulations. In a case study, we used subsets of data of various sizes from a clinical trial for predicting mortality after myocardial infarction to show how EVPI changes with sample size. Results: With a sample size of 1,000 and at the pre-specified threshold of 2% on predicted risks, the gain in net benefit by using the proposed and the correct models were 0.0006 and 0.0011, respectively, resulting in an EVPI of 0.0005 and a relative EVPI of 87%. EVPI was zero only at unrealistically high thresholds (>85%). As expected, EVPI declined with larger samples. We summarize an algorithm for incorporating EVPI calculations into the commonly used bootstrap method for optimism correction. Conclusion: Value of Information methods can be applied to explore decision-theoretic consequences of uncertainty in risk prediction and can complement inferential methods when developing risk prediction models. R code for implementing this method is provided.
翻译:背景:由于开发样本的有限规模,风险预测模型的预测概率必然不确定。我们采用信息价值方法来评价预测不确定性的决策理论影响。方法:采用巴伊西亚视角,我们从决策分析扩展完美信息的预期值的定义,到风险预测中的净效益计算。在模型开发方面,EVPI是预期通过使用正确预测而非拟议模型的预测而获得的净收益。我们建议采用“信息价值”方法,从利用蒙特卡洛模拟进行EVPI计算预测的事后分发的不确定性预测中进行取样。在一项案例研究中,我们使用临床试验中不同大小的数据组子子来预测心肌梗肿后的死亡率。我们把EVPI的预期值定义从决策值从决策值从决策值从1 000增加到预测值前的2%,而使用拟议和正确模型的回报率则分别为0.0006和0.0011,结果只有05和VEPI的预测值预测值,因此只有将EVPI的预测值从临床测试值分析值到88%的预测值方法。