Clinical prediction models are developed widely across medical disciplines. When predictors in such models are highly collinear, unexpected or spurious predictor-outcome associations may occur, thereby potentially reducing face-validity and explainability of the prediction model. Collinearity can be dealt with by exclusion of collinear predictors, but when there is no a priori motivation (besides collinearity) to include or exclude specific predictors, such an approach is arbitrary and possibly inappropriate. We compare different methods to address collinearity, including shrinkage, dimensionality reduction, and constrained optimization. The effectiveness of these methods is illustrated via simulations. In the conducted simulations, no effect of collinearity was observed on predictive outcomes. However, a negative effect of collinearity on the stability of predictor selection was found, affecting all compared methods, but in particular methods that perform strong predictor selection (e.g., Lasso).}
翻译:临床预测模型在医疗学科之间得到广泛开发,当这种模型中的预测数据高度相近、意外或虚假的预测结果可能发生时,可能会降低预测模型的面性,并有可能降低预测模型的可解释性; 可将对线预测数据排除在外,可以解决团结性问题,但当没有先验动机(除相近性外)将特定预测数据包括在内或排除在外时,这种做法是任意的,可能不适当; 我们比较了不同方法,以解决共线性,包括缩水、维度减少和限制优化。 这些方法的有效性通过模拟加以说明。 在进行模拟时,没有观察到对预测结果的共线效应。 然而,发现对预测结果的稳定性的共线性产生了负面影响,影响到所有比较方法,但特别影响到进行强有力预测选择的方法(例如,Lasso)。 }