高度正规回归估计值标准化的普遍性 (Universality of regularized regression estimators in high dimensions)

The Convex Gaussian Min-Max Theorem (CGMT) has emerged as a prominent theoretical tool for analyzing the precise stochastic behavior of various statistical estimators in the so-called high dimensional proportional regime, where the sample size and the signal dimension are of the same order. However, a well recognized limitation of the existing CGMT machinery rests in its stringent requirement on the exact Gaussianity of the design matrix, therefore rendering the obtained precise high dimensional asymptotics largely a specific Gaussian theory in various important statistical models. This paper provides a structural universality framework for a broad class of regularized regression estimators that is particularly compatible with the CGMT machinery. In particular, we show that with a good enough $\ell_\infty$ bound for the regression estimator $\hat{\mu}_A$, any `structural property' that can be detected via the CGMT for $\hat{\mu}_G$ (under a standard Gaussian design $G$) also holds for $\hat{\mu}_A$ under a general design $A$ with independent entries. As a proof of concept, we demonstrate our new universality framework in three key examples of regularized regression estimators: the Ridge, Lasso and regularized robust regression estimators, where new universality properties of risk asymptotics and/or distributions of regression estimators and other related quantities are proved. As a major statistical implication of the Lasso universality results, we validate inference procedures using the degrees-of-freedom adjusted debiased Lasso under general design and error distributions. We also provide a counterexample, showing that universality properties for regularized regression estimators do not extend to general isotropic designs.

翻译：Convex Gausian Min-Max Theorem (CGMT) 已成为一个突出的理论工具,用于分析所谓高维比例系统中各种统计估测员准确的随机行为,其样本大小和信号尺寸相同。然而,现有的CGMT机制的公认局限性在于其对设计矩阵精确度的严格要求,因此,获得的精确的高度性无症状在很大程度上是各种重要统计模型中一个特定的标准化标准标准度普遍性理论。本文提供了一个结构普遍性框架,用于分析与CGMT机制特别兼容的所谓高维比例系统中各种统计估测员的大规模定期回归估测度行为。我们特别表明,如果对回归估测仪的美元数量和信号尺寸有相当的美元约束,那么可以通过CMT为 $\hat_muçãG$提供的“结构属性”(根据标准高比值设计为美元),本文也为不固定的回归度估测值估测值估测值值值值值值值值的大规模回归估测值估算值提供了结构。