Linear mixed models (LMMs) are suitable for clustered data and are common in biometrics, medicine, survey statistics and many other fields. In those applications it is essential to carry out a valid inference after selecting a subset of the available variables. We construct confidence sets for the fixed effects in Gaussian LMMs that are based on Lasso-type estimators. Aside from providing confidence regions, this also allows to quantify the joint uncertainty of both variable selection and parameter estimation in the procedure. To show that the resulting confidence sets for the fixed effects are uniformly valid over the parameter spaces of both the regression coefficients and the covariance parameters, we also prove the novel result on uniform Cramer consistency of the restricted maximum likelihood (REML) estimators of the covariance parameters. The superiority of the constructed confidence sets to naive post-selection procedures is validated in simulations and illustrated with a study of the acid neutralization capacity of lakes in the United States.
翻译:线性混合模型(LMMs)适合于集群数据,在生物鉴别学、医学、调查统计和许多其他领域都很常见,在这些应用中,在选择可用变量的子集后,必须进行有效的推断。我们根据激光测算仪为高西亚LMMs的固定效果建立信任套件。除了提供信任区域外,还允许量化程序中变量选择和参数估计的共同不确定性。为了表明由此产生的固定效应的置信套件对回归系数和共变参数的参数空间具有统一的有效性,我们还证明共变参数限最大概率的测算器统一一致的新结果。已建信任套件优于天真的选后程序,在模拟中加以验证,并通过对美国湖泊酸中性能力的研究加以说明。