In this paper we introduce a new model building algorithm called self-validating ensemble modeling or SVEM. The method enables the fitting of validated predictive models to the relatively small data sets typically generated from designed experiments where prediction is the desired outcome which is often the case in Quality by Design studies in bio-pharmaceutical industries. In order to fit validated predictive models, SVEM uses a unique weighting scheme applied to the responses and fractional weighted bootstrapping to generate a large ensemble of fitted models. The weighting scheme allows the original data to serve both as a training set and validation set. The method is very general in application and works with most model selection algorithms. Through extensive simulation studies and a case study we demonstrate that SVEM generates models with lower prediction error as compared to more traditional statistical approaches that are based on fitting a single model.
翻译:在本文中,我们引入了一种新的模型建设算法,称为自我验证混合模型或SVEM。这种方法使经过验证的预测模型能够与通常由设计实验产生的相对较小的数据集相适应,这些实验的预期结果通常为预期结果,在生物制药工业的设计研究的质量中,通常就是这种情况。为了与经过验证的预测模型相适应,SVEM使用了一种独特的加权法和分数加权制制导法,以产生大量的装配模型。加权制使原始数据既可以用作训练组,也可以用作鉴定组。这种方法在应用中非常普遍,并且与大多数模型选择算法合作。通过广泛的模拟研究和案例研究,我们证明SVEM生成了预测错误较低的模型,而这种预测错误则比基于安装单一模型的较传统的统计方法要低。