LASSO and SLOPE are two popular methods for dimensionality reduction in the high-dimensional regression. LASSO can eliminate redundant predictors by setting the corresponding regression coefficients to zero, while SLOPE can additionally identify clusters of variables with the same absolute values of regression coefficients. It is well known that LASSO Irrepresentability Condition is sufficient and necessary for the proper estimation of the sign of sufficiently large regression coefficients. In this article we formulate an analogous Irrepresentability Condition for SLOPE, which is sufficient and necessary for the proper identification of the SLOPE pattern, i.e. of the proper sign as well as of the proper ranking of the absolute values of individual regression coefficients, while proper ranking guarantees a proper clustering. We also provide asymptotic results on the strong consistency of pattern recovery by SLOPE when the number of columns in the design matrix is fixed while the sample size diverges to infinity.
翻译:LASSO和SLOPE是高维回归度降低维度的两个流行方法。LASSO可以通过将相应的回归系数设定为零来消除冗余预测,而SLOPE还可以进一步确定具有相同回归系数绝对值的变量组群。众所周知,LASSO的不可代表性条件对于适当估计足够大回归系数的标志是足够和必要的。在本条中,我们为SLOPE制定了类似的不可代表性条件,这对于正确确定 SLOPE 模式,即正确标志和单个回归系数绝对值的适当排序是足够和必要的,而适当的排序则保证了适当的组合。我们还提供了在设计矩阵中的柱数固定时,当样本大小偏离至无限时,SLOPE恢复模式的强烈一致性。