After selection with the Group LASSO (or generalized variants such as the overlapping, sparse, or standardized Group LASSO), inference for the selected parameters is unreliable in the absence of adjustments for selection bias. In the penalized Gaussian regression setup, existing approaches provide adjustments for selection events that can be expressed as linear inequalities in the data variables. Such a representation, however, fails to hold for selection with the Group LASSO and substantially obstructs the scope of subsequent post-selective inference. Key questions of inferential interest -- for example, inference for the effects of selected variables on the outcome -- remain unanswered. In the present paper, we develop a consistent, post-selective, Bayesian method to address the existing gaps by deriving a likelihood adjustment factor and an approximation thereof that eliminates bias from the selection of groups. Experiments on simulated data and data from the Human Connectome Project demonstrate that our method recovers the effects of parameters within the selected groups while paying only a small price for bias adjustment.
翻译:在与LASSO集团(或LASSO集团,如重叠、稀少或标准化的LASSO集团等通用变体)选定参数后,在没有对选择偏差进行调整的情况下,对选定参数的推断是不可靠的。在受处罚的Gaussian回归设置中,现有方法为选择事件提供了调整,可以表现为数据变量中的线性不平等。但是,这种表述未能与LASSO集团保持选择,并严重妨碍了随后的选择性后推论的范围。 推定利益的关键问题 -- -- 例如,对选定变量对结果的影响的推论 -- -- 仍然没有答案。在本文件中,我们制定了一种一致的、后选择性的巴伊斯方法,通过得出一个可能的调整系数及其近似性来弥补现有差距,从而消除了群体选择中的偏差。关于人类连接项目模拟数据和数据的实验表明,我们的方法恢复了选定组内参数的影响,同时只为偏差调整支付很小的价格。