We study a mean-field spike and slab variational Bayes (VB) approximation to Bayesian model selection priors in sparse high-dimensional linear regression. Under compatibility conditions on the design matrix, oracle inequalities are derived for the mean-field VB approximation, implying that it converges to the sparse truth at the optimal rate and gives optimal prediction of the response vector. The empirical performance of our algorithm is studied, showing that it works comparably well as other state-of-the-art Bayesian variable selection methods. We also numerically demonstrate that the widely used coordinate-ascent variational inference (CAVI) algorithm can be highly sensitive to the parameter updating order, leading to potentially poor performance. To mitigate this, we propose a novel prioritized updating scheme that uses a data-driven updating order and performs better in simulations. The variational algorithm is implemented in the R package 'sparsevb'.
翻译:在设计矩阵的兼容性条件下,在平均VB近似值中产生甲骨骼不平等,这意味着它会以最佳速率与稀疏的真理趋同,并对反应矢量作出最佳预测。正在研究我们的算法的经验性表现,表明它与其他最先进的Bayesian变量选择方法相匹配。我们还从数字上表明,广泛使用的协调性易变推法(CAVI)算法对参数更新顺序可能非常敏感,导致可能的性能不佳。为了减轻这一点,我们提议了一个新的优先更新计划,使用数据驱动的更新顺序,并在模拟中更好地表现。变异算法在R 软件包“sparsevb”中实施。