具有共变过滤作用的强劲回归:重尾巴和对抗性污染 (Robust regression with covariate filtering: Heavy tails and adversarial contamination) - 专知论文

会员服务 ·

0

估计/估计量 · 稳健性 · 统计效率 · 情景 · SimPLe ·

2021 年 5 月 17 日

Robust regression with covariate filtering: Heavy tails and adversarial contamination

翻译：具有共变过滤作用的强劲回归:重尾巴和对抗性污染

Ankit Pensia,Varun Jog,Po-Ling Loh

from arxiv, V2: Adds new results for unknown covariance matrix (Theorem 3.13), Gaussian design (Remark 3.12), and Simulations (Section 7)

We study the problem of linear regression where both covariates and responses are potentially (i) heavy-tailed and (ii) adversarially contaminated. Several computationally efficient estimators have been proposed for the simpler setting where the covariates are sub-Gaussian and uncontaminated; however, these estimators may fail when the covariates are either heavy-tailed or contain outliers. In this work, we show how to modify the Huber regression, least trimmed squares, and least absolute deviation estimators to obtain estimators which are simultaneously computationally and statistically efficient in the stronger contamination model. Our approach is quite simple, and consists of applying a filtering algorithm to the covariates, and then applying the classical robust regression estimators to the remaining data. We show that the Huber regression estimator achieves near-optimal error rates in this setting, whereas the least trimmed squares and least absolute deviation estimators can be made to achieve near-optimal error after applying a postprocessing step.

翻译：我们研究了线性回归问题,因为两者的共变和反应都有可能(一) 重尾和(二) 对抗性污染。我们为较简单的环境提出了若干计算效率高的估算器,因为共变是亚加西语和未受污染的;然而,当共变是重尾或含有外源值时,这些估算器可能会失败。在这项工作中,我们展示了如何修改Huber回归、最小减缩方形和最小绝对偏差估计器,以获得在较强的污染模型中同时计算和统计效率的估算器。我们的方法非常简单,包括对共变法应用过滤算法,然后对剩余数据应用典型的强重回归估计器。我们显示Huber回归估计器在这个环境中达到接近最佳的错误率,而最小减缩方形和最小偏差估计器可以在应用后一步后达到近最佳的错误。

0

相关内容

估计/估计量

估计/估计量

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【Google AI论文】无妥协的弱监督解缠，Weakly-Supervised Disentanglement Without Compromises

【Google AI论文】无妥协的弱监督解缠，Weakly-Supervised Disentanglement Without Compromises

专知会员服务

20+阅读 · 2020年2月12日

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

专知会员服务

90+阅读 · 2019年12月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Robust Variable Selection and Estimation Via Adaptive Elastic Net S-Estimators for Linear Regression

Arxiv

0+阅读 · 2021年7月7日

Finite sample breakdown point of multivariate regression depth median

Arxiv

0+阅读 · 2021年7月7日

Distributed Adaptive Huber Regression

Distributed Adaptive Huber Regression

Arxiv

0+阅读 · 2021年7月6日

Treatment Effects Estimation by Uniform Transformer

Arxiv

0+阅读 · 2021年7月6日

Neuronized Priors for Bayesian Sparse Linear Regression

Arxiv

0+阅读 · 2021年7月6日

Wasserstein Regression

Arxiv

0+阅读 · 2021年7月6日

Sufficient principal component regression for pattern discovery in transcriptomic data

Sufficient principal component regression for pattern discovery in transcriptomic data

Arxiv

0+阅读 · 2021年7月5日

Using Robust Regression to Find Font Usage Trends

Arxiv

0+阅读 · 2021年7月5日

Regression-Adjusted Estimation of Quantile Treatment Effects under Covariate-Adaptive Randomizations

Arxiv

0+阅读 · 2021年7月3日

A Robust Seemingly Unrelated Regressions For Row-Wise And Cell-Wise Contamination

Arxiv

0+阅读 · 2021年7月2日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【Google AI论文】无妥协的弱监督解缠，Weakly-Supervised Disentanglement Without Compromises

【Google AI论文】无妥协的弱监督解缠，Weakly-Supervised Disentanglement Without Compromises

专知会员服务

20+阅读 · 2020年2月12日

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

【新开放书】医学影像原理与应用，Medical Imaging Principles and Applications

专知会员服务

90+阅读 · 2019年12月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

从社会学实验到行为仿真：理解基于Agent的观点动力学建模思维

中英文版《GPT-5 System Card速览》报告

ACL 2025 | 大模型结构化知识提示的泛化能力研究

【普林斯顿博士论文】大型模型的高效推理

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Robust Variable Selection and Estimation Via Adaptive Elastic Net S-Estimators for Linear Regression

Arxiv

0+阅读 · 2021年7月7日

Finite sample breakdown point of multivariate regression depth median

Arxiv

0+阅读 · 2021年7月7日

Distributed Adaptive Huber Regression

Distributed Adaptive Huber Regression

Arxiv

0+阅读 · 2021年7月6日

Treatment Effects Estimation by Uniform Transformer

Arxiv

0+阅读 · 2021年7月6日

Neuronized Priors for Bayesian Sparse Linear Regression

Arxiv

0+阅读 · 2021年7月6日

Wasserstein Regression

Arxiv

0+阅读 · 2021年7月6日

Sufficient principal component regression for pattern discovery in transcriptomic data

Sufficient principal component regression for pattern discovery in transcriptomic data

Arxiv

0+阅读 · 2021年7月5日

Using Robust Regression to Find Font Usage Trends

Arxiv

0+阅读 · 2021年7月5日

Regression-Adjusted Estimation of Quantile Treatment Effects under Covariate-Adaptive Randomizations

Arxiv

0+阅读 · 2021年7月3日

A Robust Seemingly Unrelated Regressions For Row-Wise And Cell-Wise Contamination

Arxiv

0+阅读 · 2021年7月2日

微信扫码咨询专知VIP会员