We consider post-selection inference for high-dimensional (generalized) linear models. Data carving (Fithian et al., 2014) is a promising technique to perform this task. However, it suffers from the instability of the model selector and hence, may lead to poor replicability, especially in high-dimensional settings. We propose the multicarve method inspired by multisplitting to improve upon stability and replicability. Furthermore, we extend existing concepts to group inference and illustrate the applicability of the methodology also for generalized linear models.
翻译:我们认为高维(通用)线性模型的选后推论是高维(通用)线性模型,数据刻录(Fithian等人,2014年)是完成这项任务的一个很有希望的方法,但模型选择器不稳定,因此可能导致复制能力差,特别是在高维环境中。我们提出多分法,以改善稳定性和可复制性。此外,我们将现有概念扩展至分组推论,并表明该方法也适用于通用线性模型。