This paper presents a selective survey of recent developments in statistical inference and multiple testing for high-dimensional regression models, including linear and logistic regression. We examine the construction of confidence intervals and hypothesis tests for various low-dimensional objectives such as regression coefficients and linear and quadratic functionals. The key technique is to generate debiased and desparsified estimators for the targeted low-dimensional objectives and estimate their uncertainty. In addition to covering the motivations for and intuitions behind these statistical methods, we also discuss their optimality and adaptivity in the context of high-dimensional inference. In addition, we review the recent development of statistical inference based on multiple regression models and the advancement of large-scale multiple testing for high-dimensional regression. The R package SIHR has implemented some of the high-dimensional inference methods discussed in this paper.
翻译:本文件有选择地调查了高维回归模型统计推论和多重测试的最新发展情况,包括线性和后勤性回归。我们研究了各种低维目标,如回归系数、线性和二次功能等,建立信心间隔和假设测试的情况。关键技术是为目标低维目标产生分级和分解的估算器,并估计其不确定性。除了涵盖这些统计方法的动机和直觉外,我们还在高维推论的背景下讨论了这些方法的最佳性和适应性。此外,我们审视了基于多重回归模型的统计推论的最新发展情况,以及高维回归大规模多重测试的进展。SISHR一揽子方案还实施了本文讨论的一些高维推论方法。