Background: Predicted probabilities from a risk prediction model are inevitably uncertain. This uncertainty has mostly been studied from a statistical perspective. We apply Value of Information methodology to evaluate the decision-theoretic implications of prediction uncertainty. Methods: Adopting a Bayesian perspective, we extend the definition of the Expected Value of Perfect Information (EVPI) from decision analysis to net benefit calculations in risk prediction. EVPI is the expected gain in net benefit by using the correct predictions as opposed to predictions from a proposed model. We suggest bootstrap methods for sampling from the posterior distribution of predictions for EVPI calculation using Monte Carlo simulations. In a case study, we used subsets of data of various sizes from a clinical trial for predicting mortality after myocardial infarction to show how EVPI can be interpreted and how it changes with sample size. Results: With a sample size of 1,000, EVPI was 0 at threshold values larger than 0.6, indicating there is no point in procuring more development data for such thresholds. At thresholds of 0.4-0.6, the proposed model was not net beneficial, but EVPI was positive, indicating that obtaining more development data might be justified. Across all thresholds, the gain in net benefit by using the correct model was 24% higher than the gain by using the proposed model. EVPI declined with larger samples and was generally low with sample sizes of 4,000 or greater. We summarize an algorithm for incorporating EVPI calculations into the commonly used bootstrap method for optimism correction. Conclusion: Value of Information methods can be applied to explore decision-theoretic consequences of uncertainty in risk prediction, and can complement inferential methods when developing or validating risk prediction models.
翻译:背景:风险预测模型的预测概率必然不确定。这种不确定性大多是从统计角度研究的。我们采用信息价值方法来评估预测不确定性的决策理论影响。我们采用信息价值方法来评估预测不确定性的决策理论影响。方法:采用巴伊西亚视角,我们从决策分析中将完美信息的预期值定义扩展至风险预测中的净效益计算。EVPI是使用正确预测而不是拟议模型预测的预测而预期净收益。我们建议了从利用蒙特卡洛模拟计算 EVPI计算预测的事后分布中取样的靴套方法。在一项案例研究中,我们使用临床试验中不同大小的数据子子子来预测心肌梗肿后的死亡率。方法:从决策角度,我们将EVPI的预期值定义扩展到风险预测的预期值净值上,预期EVPI的预测值净收益上,预计使用正确的数值值值值值值为0.6,表明没有为此类阈值获得更多的发展数据。在0.4-0.6的阈值上,拟议模型是没有净应用的,但EVPI的数值值是各种数值的数值组数组数组数组数组数,在计算中,在使用更高的数值上,在使用模型中可以使用更高的数值计算中,在计算中,在计算中,在使用更高的值值中,在计算中可以得出中采用更高的值值值值值值中,因此,在使用更值值中,因此,在计算。