LASSO在招牌回收、门槛值LASSO和门槛底底拖网 (On the sign recovery by LASSO, thresholded LASSO and thresholded Basis Pursuit Denoising) - 专知论文

会员服务 ·

0

可辨认的 · 阈值 · 去噪 · 预测器/决策函数 · 向量化 ·

2021 年 8 月 31 日

On the sign recovery by LASSO, thresholded LASSO and thresholded Basis Pursuit Denoising

翻译：LASSO在招牌回收、门槛值LASSO和门槛底底拖网

Patrick J. C. Tardivel,Malgorzata Bogdan

Basis Pursuit (BP), Basis Pursuit DeNoising (BPDN), and LASSO are popular methods for identifying important predictors in the high-dimensional linear regression model, i.e. when the number of rows of the design matrix X is smaller than the number of columns. By definition, BP uniquely recovers the vector of regression coefficients b if there is no noise and the vector b has the smallest L1 norm among all vectors s such that Xb=Xs (identifiability condition). Furthermore, LASSO can recover the sign of b only under a much stronger irrepresentability condition. Meanwhile, it is known that the model selection properties of LASSO can be improved by hard-thresholding its estimates. This article supports these findings by proving that thresholded LASSO, thresholded BPDN and thresholded BP recover the sign of b in both the noisy and noiseless cases if and only if b is identifiable and large enough. In particular, if X has iid Gaussian entries and the number of predictors grows linearly with the sample size, then these thresholded estimators can recover the sign of b when the signal sparsity is asymptotically below the Donoho-Tanner transition curve. This is in contrast to the regular LASSO, which asymptotically recovers the sign of b only when the signal sparsity tends to 0. Numerical experiments show that the identifiability condition, unlike the irrepresentability condition, does not seem to be affected by the structure of the correlations in the $X$ matrix.

翻译：根据定义,如果没有噪音,BP单能恢复回归系数b的矢量矢量,b矢量b在所有矢量中具有最小的L1标准,例如Xb=X(可识别性条件),LASO只有在无法显示的情况下才能恢复b的标志。此外,LASO在高度线性回归模型中,也就是当设计矩阵X的行数小于列数时,即当设计矩阵XX的行数小于列数时,确定BB的重要预测值的常用方法是流行的。同时,众所周知,如果设计矩阵XSO的行数小于列线性线性回归模型选择值,即当设计矩阵XSO的行数小数小于线性线性回归时,LASSO的模型选择值属性可以通过硬性保存其估计值来改进。根据定义,这篇文章支持这些结果,通过证明LASSO的门槛值、门槛BPN和门槛性 BP,只有在b可识别性和无噪音的情况下,如果X值条目的条目和预测值的矩阵的数值值与样本大小相当,则只能通过直线性递增。

0

相关内容

可辨认的

【KDD2021】图神经网络，NUS- Xavier Bresson教授

【KDD2021】图神经网络，NUS- Xavier Bresson教授

专知会员服务

66+阅读 · 2021年8月20日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【Google】梯度下降，48页ppt

【Google】梯度下降，48页ppt

专知会员服务

81+阅读 · 2020年12月5日

Google最新《机器学习对偶性》报告，48页ppt

Google最新《机器学习对偶性》报告，48页ppt

专知会员服务

36+阅读 · 2020年11月29日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

【Facebook AI】低资源机器翻译，74页ppt

【Facebook AI】低资源机器翻译，74页ppt

专知会员服务

30+阅读 · 2020年4月8日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

专知会员服务

21+阅读 · 2019年12月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

一文读懂线性回归、岭回归和Lasso回归

一文读懂线性回归、岭回归和Lasso回归

CSDN

34+阅读 · 2019年10月13日

【机器学习】一文读懂线性回归、岭回归和Lasso回归

【机器学习】一文读懂线性回归、岭回归和Lasso回归

AINLP

20+阅读 · 2019年10月12日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

春节充电系列：李宏毅2017机器学习课程学习笔记02之Regression

春节充电系列：李宏毅2017机器学习课程学习笔记02之Regression

专知

3+阅读 · 2018年2月13日

【LeetCode 500】关关的刷题日记27 Keyboard Row

【LeetCode 500】关关的刷题日记27 Keyboard Row

专知

3+阅读 · 2017年11月5日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

关关的刷题日记13——Leetcode 414. Third Maximum Number

关关的刷题日记13——Leetcode 414. Third Maximum Number

专知

3+阅读 · 2017年10月8日

Logistic回归第二弹——Softmax Regression

Logistic回归第二弹——Softmax Regression

机器学习深度学习实战原创交流

9+阅读 · 2015年10月29日

Logistic回归第一弹——二项Logistic Regression

Logistic回归第一弹——二项Logistic Regression

机器学习深度学习实战原创交流

3+阅读 · 2015年10月22日

On the largest singular values of certain large random matrices with application to the estimation of the minimal dimension of the state-space representations of high-dimensional time series

Arxiv

0+阅读 · 2021年10月22日

High-Dimensional Learning under ApproximateSparsity with Applications to Nonsmooth Estimation and Regularized Neural Networks

Arxiv

0+阅读 · 2021年10月22日

Variable selection in doubly truncated regression

Arxiv

0+阅读 · 2021年10月20日

Randomized Empirical Processes by Algebraic Groups, and Tests for Weak Null Hypotheses

Arxiv

0+阅读 · 2021年10月19日

Persuasion by Dimension Reduction

Arxiv

0+阅读 · 2021年10月17日

A variational non-linear constrained model for the inversion of FDEM data

Arxiv

0+阅读 · 2021年10月17日

De-biased Lasso for Generalized Linear Models with A Diverging Number of Covariates

Arxiv

0+阅读 · 2021年10月16日

Spectral measures of empirical autocovariance matrices of high dimensional Gaussian stationary processes

Arxiv

0+阅读 · 2021年10月16日

Generalized regression operator estimation for continuous time functional data processes with missing at random response

Arxiv

0+阅读 · 2021年10月15日

Non-existing and ill-behaved coequalizers of locally ordered spaces

Arxiv

0+阅读 · 2021年10月15日

VIP会员

文章信息

相关主题

预测器/决策函数

相关VIP内容

【KDD2021】图神经网络，NUS- Xavier Bresson教授

【KDD2021】图神经网络，NUS- Xavier Bresson教授

专知会员服务

66+阅读 · 2021年8月20日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【Google】梯度下降，48页ppt

【Google】梯度下降，48页ppt

专知会员服务

81+阅读 · 2020年12月5日

Google最新《机器学习对偶性》报告，48页ppt

Google最新《机器学习对偶性》报告，48页ppt

专知会员服务

36+阅读 · 2020年11月29日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

【Facebook AI】低资源机器翻译，74页ppt

【Facebook AI】低资源机器翻译，74页ppt

专知会员服务

30+阅读 · 2020年4月8日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

在线变分推断，76页ppt，A Regret Bound for Online Variational Inference

专知会员服务

21+阅读 · 2019年12月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

一文读懂线性回归、岭回归和Lasso回归

一文读懂线性回归、岭回归和Lasso回归

CSDN

34+阅读 · 2019年10月13日

【机器学习】一文读懂线性回归、岭回归和Lasso回归

【机器学习】一文读懂线性回归、岭回归和Lasso回归

AINLP

20+阅读 · 2019年10月12日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

春节充电系列：李宏毅2017机器学习课程学习笔记02之Regression

春节充电系列：李宏毅2017机器学习课程学习笔记02之Regression

专知

3+阅读 · 2018年2月13日

【LeetCode 500】关关的刷题日记27 Keyboard Row

【LeetCode 500】关关的刷题日记27 Keyboard Row

专知

3+阅读 · 2017年11月5日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

关关的刷题日记13——Leetcode 414. Third Maximum Number

关关的刷题日记13——Leetcode 414. Third Maximum Number

专知

3+阅读 · 2017年10月8日

Logistic回归第二弹——Softmax Regression

Logistic回归第二弹——Softmax Regression

机器学习深度学习实战原创交流

9+阅读 · 2015年10月29日

Logistic回归第一弹——二项Logistic Regression

Logistic回归第一弹——二项Logistic Regression

机器学习深度学习实战原创交流

3+阅读 · 2015年10月22日

相关论文

On the largest singular values of certain large random matrices with application to the estimation of the minimal dimension of the state-space representations of high-dimensional time series

Arxiv

0+阅读 · 2021年10月22日

High-Dimensional Learning under ApproximateSparsity with Applications to Nonsmooth Estimation and Regularized Neural Networks

Arxiv

0+阅读 · 2021年10月22日

Variable selection in doubly truncated regression

Arxiv

0+阅读 · 2021年10月20日

Randomized Empirical Processes by Algebraic Groups, and Tests for Weak Null Hypotheses

Arxiv

0+阅读 · 2021年10月19日

Persuasion by Dimension Reduction

Arxiv

0+阅读 · 2021年10月17日

A variational non-linear constrained model for the inversion of FDEM data

Arxiv

0+阅读 · 2021年10月17日

De-biased Lasso for Generalized Linear Models with A Diverging Number of Covariates

Arxiv

0+阅读 · 2021年10月16日

Spectral measures of empirical autocovariance matrices of high dimensional Gaussian stationary processes

Arxiv

0+阅读 · 2021年10月16日

Generalized regression operator estimation for continuous time functional data processes with missing at random response

Arxiv

0+阅读 · 2021年10月15日

Non-existing and ill-behaved coequalizers of locally ordered spaces

Arxiv

0+阅读 · 2021年10月15日

微信扫码咨询专知VIP会员