翻译后的标题： (Out-of-sample error estimate for robust M-estimators with convex penalty) - 专知论文

会员服务 ·

0

HAT · 误差估计 · 鲁棒 · 高维 · 污染 ·

2023 年 3 月 30 日

Out-of-sample error estimate for robust M-estimators with convex penalty

翻译：翻译后的标题：

Pierre C Bellec

from arxiv, This version adds simulations for the nuclear norm penalty

A generic out-of-sample error estimate is proposed for robust $M$-estimators regularized with a convex penalty in high-dimensional linear regression where $(X,y)$ is observed and $p,n$ are of the same order. If $\psi$ is the derivative of the robust data-fitting loss $\rho$, the estimate depends on the observed data only through the quantities $\hat\psi = \psi(y-X\hat\beta)$, $X^\top \hat\psi$ and the derivatives $(\partial/\partial y) \hat\psi$ and $(\partial/\partial y) X\hat\beta$ for fixed $X$. The out-of-sample error estimate enjoys a relative error of order $n^{-1/2}$ in a linear model with Gaussian covariates and independent noise, either non-asymptotically when $p/n\le \gamma$ or asymptotically in the high-dimensional asymptotic regime $p/n\to\gamma'\in(0,\infty)$. General differentiable loss functions $\rho$ are allowed provided that $\psi=\rho'$ is 1-Lipschitz. The validity of the out-of-sample error estimate holds either under a strong convexity assumption, or for the $\ell_1$-penalized Huber M-estimator if the number of corrupted observations and sparsity of the true $\beta$ are bounded from above by $s_*n$ for some small enough constant $s_*\in(0,1)$ independent of $n,p$. For the square loss and in the absence of corruption in the response, the results additionally yield $n^{-1/2}$-consistent estimates of the noise variance and of the generalization error. This generalizes, to arbitrary convex penalty, estimates that were previously known for the Lasso.

翻译：基于凸惩罚的鲁棒M估计值的样外误差估计翻译后的摘要：提出了一种通用的基于样本外数据的误差估计方法，适用于高维线性回归中，采用凸惩罚的鲁棒M估计值。假设$(X,y)$是可观测的，$p$和$n$是同阶的。如果$\psi$是鲁棒数据拟合损失$\rho$的导数，则估计量仅通过量$\hat\psi = \psi(y−X\hat\beta)$、$X^\top\hat\psi$和$\hat\psi$及$X\hat\beta$对$y$的偏导数来依赖于观测数据。对于高斯协变量和独立噪声的线性模型，只要是$p/n\le \gamma$，则在非渐进性时期估计量的相对误差为$n^{-1/2}$；在$p/n\to\gamma'\in(0,\infty)$的高维渐进区域则渐进成立。若对真实$\beta$的稀疏性和有限数量的污染观测进行了严格的限制，则基于值$\hat\psi$的样外误差估计的有效性得到了保证。$\rho$ 是可导的损失函数，要满足$\psi=\rho'$的1-Lipschitz条件。本文推广了Lasso的估计结果，适用于任意凸惩罚的估计，特别是对于平方损失且响应无污染的情况，结果还提供了噪声方差和泛化误差的$n^{−1/2}$ -一致估计。

0

相关内容

HAT

【ICCV2021】递阶变分神经不确定性模型的随机视频预测

专知会员服务

14+阅读 · 2021年10月9日

分布外泛化(Out-Of-Distribution Generalization) 综述论文，22页pdf240篇文献

专知会员服务

64+阅读 · 2021年9月2日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

专知会员服务

33+阅读 · 2020年3月23日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

互信息论文笔记

互信息论文笔记

CreateAMind

23+阅读 · 2018年8月23日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新五篇命名实体识别相关论文—深度主动学习、Lattice LSTM、混合马尔可夫CRF

【论文推荐】最新五篇命名实体识别相关论文—深度主动学习、Lattice LSTM、混合马尔可夫CRF

专知

26+阅读 · 2018年5月22日

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

专知

25+阅读 · 2018年4月29日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

带跳非耦合正倒向随机微分方程的Crank-Nicolson数值解法研究

国家自然科学基金

0+阅读 · 2014年12月31日

量子群与Tewilliger代数的相关问题研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于复合分位数回归和最大秩相关想法的ROC回归曲线估计

国家自然科学基金

0+阅读 · 2013年12月31日

多元线性整值时间序列的统计分析

国家自然科学基金

2+阅读 · 2013年12月31日

含指标项的变换模型的估计与经验似然分析

国家自然科学基金

0+阅读 · 2012年12月31日

神经细胞发育过程中钙调蛋白激酶I表达的调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

小样本空间制图

国家自然科学基金

0+阅读 · 2012年12月31日

南极海冰与冰盖的质量变化及其对全球海平面变化贡献的研究

国家自然科学基金

0+阅读 · 2012年12月31日

约束Markov过程的大偏差与拟遍历性及相关问题

国家自然科学基金

0+阅读 · 2012年12月31日

G蛋白偶联受体信号调控分子在突触可塑性中的作用及机制

国家自然科学基金

0+阅读 · 2008年12月31日

Q-malizing flow and infinitesimal density ratio estimation

Arxiv

0+阅读 · 2023年5月19日

Distortion Under Public-Spirited Voting

Arxiv

0+阅读 · 2023年5月19日

Time Optimal Ergodic Search

Arxiv

0+阅读 · 2023年5月19日

A Novel Tensor Factorization-Based Method with Robustness to Inaccurate Rank Estimation

Arxiv

0+阅读 · 2023年5月19日

The Geometry of Neural Nets' Parameter Spaces Under Reparametrization

Arxiv

0+阅读 · 2023年5月18日

Your diffusion model secretly knows the dimension of the data manifold

Arxiv

0+阅读 · 2023年5月18日

Convergence Analysis of Over-the-Air FL with Compression and Power Control via Clipping

Arxiv

0+阅读 · 2023年5月18日

TPMDP: Threshold Personalized Multi-party Differential Privacy via Optimal Gaussian Mechanism

Arxiv

0+阅读 · 2023年5月18日

Spectral Change Point Estimation for High Dimensional Time Series by Sparse Tensor Decomposition

Arxiv

0+阅读 · 2023年5月18日

Exploring Uniform Finite Sample Stickiness

Arxiv

0+阅读 · 2023年5月17日

VIP会员

文章信息

相关主题

相关VIP内容

【ICCV2021】递阶变分神经不确定性模型的随机视频预测

专知会员服务

14+阅读 · 2021年10月9日

分布外泛化(Out-Of-Distribution Generalization) 综述论文，22页pdf240篇文献

专知会员服务

64+阅读 · 2021年9月2日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

专知会员服务

33+阅读 · 2020年3月23日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

互信息论文笔记

互信息论文笔记

CreateAMind

23+阅读 · 2018年8月23日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新五篇命名实体识别相关论文—深度主动学习、Lattice LSTM、混合马尔可夫CRF

【论文推荐】最新五篇命名实体识别相关论文—深度主动学习、Lattice LSTM、混合马尔可夫CRF

专知

26+阅读 · 2018年5月22日

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

专知

25+阅读 · 2018年4月29日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

相关论文

Q-malizing flow and infinitesimal density ratio estimation

Arxiv

0+阅读 · 2023年5月19日

Distortion Under Public-Spirited Voting

Arxiv

0+阅读 · 2023年5月19日

Time Optimal Ergodic Search

Arxiv

0+阅读 · 2023年5月19日

A Novel Tensor Factorization-Based Method with Robustness to Inaccurate Rank Estimation

Arxiv

0+阅读 · 2023年5月19日

The Geometry of Neural Nets' Parameter Spaces Under Reparametrization

Arxiv

0+阅读 · 2023年5月18日

Your diffusion model secretly knows the dimension of the data manifold

Arxiv

0+阅读 · 2023年5月18日

Convergence Analysis of Over-the-Air FL with Compression and Power Control via Clipping

Arxiv

0+阅读 · 2023年5月18日

TPMDP: Threshold Personalized Multi-party Differential Privacy via Optimal Gaussian Mechanism

Arxiv

0+阅读 · 2023年5月18日

Spectral Change Point Estimation for High Dimensional Time Series by Sparse Tensor Decomposition

Arxiv

0+阅读 · 2023年5月18日

Exploring Uniform Finite Sample Stickiness

Arxiv

0+阅读 · 2023年5月17日

相关基金

带跳非耦合正倒向随机微分方程的Crank-Nicolson数值解法研究

国家自然科学基金

0+阅读 · 2014年12月31日

量子群与Tewilliger代数的相关问题研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于复合分位数回归和最大秩相关想法的ROC回归曲线估计

国家自然科学基金

0+阅读 · 2013年12月31日

多元线性整值时间序列的统计分析

国家自然科学基金

2+阅读 · 2013年12月31日

含指标项的变换模型的估计与经验似然分析

国家自然科学基金

0+阅读 · 2012年12月31日

神经细胞发育过程中钙调蛋白激酶I表达的调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

小样本空间制图

国家自然科学基金

0+阅读 · 2012年12月31日

南极海冰与冰盖的质量变化及其对全球海平面变化贡献的研究

国家自然科学基金

0+阅读 · 2012年12月31日

约束Markov过程的大偏差与拟遍历性及相关问题

国家自然科学基金

0+阅读 · 2012年12月31日

G蛋白偶联受体信号调控分子在突触可塑性中的作用及机制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员