We consider methods for transporting a prediction model and assessing its performance for use in a new target population, when outcome and covariate information for model development is available from a simple random sample from the source population, but only covariate information is available on a simple random sample from the target population. We discuss how to tailor the prediction model for use in the target population, how to assess model performance in the target population (e.g., by estimating the target population mean squared error), and how to perform model and tuning parameter selection in the context of the target population. We provide identifiability results for the target population mean squared error of a potentially misspecified prediction model under a sampling design where the source study and the target population samples are obtained separately. We also introduce the concept of prediction error modifiers that can be used to reason about the need for tailoring measures of model performance to the target population and provide an illustration of the methods using simulated data.
翻译:我们考虑如何运输预测模型并评估其性能,供新的目标人群使用,如果从来源人群的简单随机抽样中可以获得用于模型开发的结果和共变信息,但从目标人群的简单随机抽样中只能获得共变信息,我们讨论如何调整预测模型,供目标人群使用,如何评估目标人群的模型性能(例如,通过估计目标人群平均平方误差),以及如何在目标人群中执行模型和调整参数选择。我们提供目标人群的可识别性结果,指的是在抽样设计中可能错误地指定预测模型的正方形错误,在抽样设计中分别获取源研究和目标人群样本。我们还介绍了预测误差改变者的概念,以便说明需要根据目标人群对模型性能进行量度,并举例说明使用模拟数据的方法。