Models with dimension more than the available sample size are now commonly used in various applications. A sensible inference is possible using a lower-dimensional structure. In regression problems with a large number of predictors, the model is often assumed to be sparse, with only a few predictors active. Interdependence between a large number of variables is succinctly described by a graphical model, where variables are represented by nodes on a graph and an edge between two nodes is used to indicate their conditional dependence given other variables. Many procedures for making inferences in the high-dimensional setting, typically using penalty functions to induce sparsity in the solution obtained by minimizing a loss function, were developed. Bayesian methods have been proposed for such problems more recently, where the prior takes care of the sparsity structure. These methods have the natural ability to also automatically quantify the uncertainty of the inference through the posterior distribution. Theoretical studies of Bayesian procedures in high-dimension have been carried out recently. Questions that arise are, whether the posterior distribution contracts near the true value of the parameter at the minimax optimal rate, whether the correct lower-dimensional structure is discovered with high posterior probability, and whether a credible region has adequate frequentist coverage. In this paper, we review these properties of Bayesian and related methods for several high-dimensional models such as many normal means problem, linear regression, generalized linear models, Gaussian and non-Gaussian graphical models. Effective computational approaches are also discussed.
翻译:现在,在各种应用中,通常使用比现有抽样规模多得多的模型。 使用低维结构,可以作出合理的推论。 在大量预测器的回归问题中,模型通常被假定为稀疏,只有少数预测器活跃。 大量变量之间的相互依存以图形模型简单描述,其中变量在图形上的节点和两个节点之间的边缘代表了图表上的节点,根据其他变量,对高维环境中的有条件依赖性进行了理论研究。 在高维环境中进行推论的许多程序,通常使用惩罚功能诱发通过尽量减少损失功能获得的解决方案的偏移。在大量预测器的回归问题中,开发了典型的处罚功能。在较近于通过尽可能减少损失函数获得的参数的精确度问题上,巴伊西亚方法被建议了更近一些的方法,而先前的预测器则照顾了宽度结构。这些方法具有自然能力,也可以自动量化通过海平面分布的推断的不确定性。 近期对巴耶斯程序进行了理论研究,根据其他变量进行了很多的理论性研究。 出现的问题是, 后方位分配合同是否接近于最小型最佳比率的参数的真正值, 是否是高平面的直径直径分析, 是否是高平面结构结构结构结构结构结构,是否是高的。