量回归的混合效果建模和变量选择 (Mixed Effect Modeling and Variable Selection for Quantile Regression)

It is known that the estimating equations for quantile regression (QR) can be solved using an EM algorithm in which the M-step is computed via weighted least squares, with weights computed at the E-step as the expectation of independent generalized inverse-Gaussian variables. This fact is exploited here to extend QR to allow for random effects in the linear predictor. Convergence of the algorithm in this setting is established by showing that it is a generalized alternating minimization (GAM) procedure. Another modification of the EM algorithm also allows us to adapt a recently proposed method for variable selection in mean regression models to the QR setting. Simulations show the resulting method significantly outperforms variable selection in QR models using the lasso penalty. Applications to real data include a frailty QR analysis of hospital stays, and variable selection for age at onset of lung cancer and for riboflavin production rate using high-dimensional gene expression arrays for prediction.

翻译：已知的是,量化回归(QR)的估计方程可以通过EM算法来解决,在EM算法中,以加权最小方位计算M级步数,在E级计算加权数,作为独立通用反倒数-Gausian变量的预期值。这一事实在这里被用来扩展QR,以允许线性预测器中的随机效果。此设置中算法的趋同是通过显示它是普遍交替最小化(GAM)程序确定的。对EM算法的另一项修改还使我们能够将最近提出的在平均回归模型中的变量选择方法调整到QR设置。模拟显示所产生的方法大大优于使用 lasso 罚款在QR 模型中的变量选择。对真实数据的应用包括对住院期的脆弱QR分析,以及使用高维基因表达阵列进行预测,对肺癌起始年龄和肋骨拉文生产率的变量选择。