Linear mixed models are a versatile statistical tool to study data by accounting for fixed effects and random effects from multiple sources of variability. In many situations, a large number of candidate fixed effects is available and it is of interest to select a parsimonious subset of those being effectively relevant for predicting the response variable. Variational approximations facilitate fast approximate Bayesian inference for the parameters of a variety of statistical models, including linear mixed models. However, for models having a high number of fixed or random effects, simple application of standard variational inference principles does not lead to fast approximate inference algorithms, due to the size of model design matrices and inefficient treatment of sparse matrix problems arising from the required approximating density parameters updates. We illustrate how recently developed streamlined variational inference procedures can be generalized to make fast and accurate inference for the parameters of linear mixed models with nested random effects and global-local priors for Bayesian fixed effects selection. Our variational inference algorithms achieve convergence to the same optima of their standard implementations, although with significantly lower computational effort, memory usage and time, especially for large numbers of random effects. Using simulated and real data examples, we assess the quality of automated procedures for fixed effects selection that are free from hyperparameters tuning and only rely upon variational posterior approximations. Moreover, we show high accuracy of variational approximations against model fitting via Markov Chain Monte Carlo sampling.
翻译:线性混合模型是一个多用途统计工具,用于通过计算固定效应和来自多种变异来源的随机效应来研究数据;在许多情况下,由于模型设计矩阵规模小,对所需近似密度参数更新引起的分散的矩阵问题处理效率低,因此有大量可供选择的固定效应和来自多种变异来源的随机效应,有兴趣选择对预测响应变量具有有效相关性的偏差子组;变式近似有助于快速近似巴伊西亚对各种统计模型参数的推断,包括线性混合模型;但是,对于固定效应或随机效应较多的模型,简单应用标准变异推断原则不会导致快速近似推理算法,因为模型设计矩阵规模小,对所需近似密度密度参数更新引起的分散矩阵问题处理效率低;我们说明最近制定简化的变异性计算程序如何普遍化,以便快速准确地推导出具有巢性随机效应的线性混合模型参数,以及选择巴伊西固定效应模型的全球地方前科。 我们的变率算法不会导致其标准实施情况的一致,尽管其计算努力、记忆使用率和时间处理效率低得多的问题,特别是从我们测算结果的精确度模型的精确度模型上,我们只能评估。