稳健统一平均估计和回归的临界抽样手段 (Trimmed sample means for robust uniform mean estimation and regression) - 专知论文

会员服务 ·

0

估计/估计量 · 均值 · UniFormer · 稳健性 · Performer ·

2023 年 2 月 15 日

Trimmed sample means for robust uniform mean estimation and regression

翻译：稳健统一平均估计和回归的临界抽样手段

Roberto I. Oliveira,Lucas Resende

It is well-known that trimmed sample means are robust against heavy tails and data contamination. This paper analyzes the performance of trimmed means and related methods in two novel contexts. The first one consists of estimating expectations of functions in a given family, with uniform error bounds; this is closely related to the problem of estimating the mean of a random vector under a general norm. The second problem considered is that of regression with quadratic loss. In both cases, trimmed-mean-based estimators are the first to obtain optimal dependence on the (adversarial) contamination level. Moreover, they also match or improve upon the state of the art in terms of heavy tails. Experiments with synthetic data show that a natural ``trimmed mean linear regression'' method often performs better than both ordinary least squares and alternative methods based on median-of-means.

翻译：众所周知,剪切的样本手段对重尾巴和数据污染具有很强的抗力。本文分析了两种新情况中剪裁的方法和相关方法的性能。第一是估计特定家庭功能的预期值,有统一的误差界限;这与根据一般规范估计随机矢量的平均值的问题密切相关。第二个问题被考虑为以四面体损失为代价的回归问题。在这两种情况下,以裁剪为主的估测器是首先获得对(对抗性)污染水平的最佳依赖的。此外,它们还匹配或改进了重尾品的先进水平。合成数据的实验表明,自然“断切线性线性回归方法”往往比普通的最小方形和基于中位值的替代方法效果更好。

0

相关内容

估计/估计量

估计/估计量

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

含铱配合物的肿瘤缺氧敏感的近红外高分子探针

国家自然科学基金

0+阅读 · 2014年12月31日

量子点中电子自旋量子比特的声子效应研究

国家自然科学基金

0+阅读 · 2012年12月31日

Massive MIMO系统关键技术的研究

国家自然科学基金

0+阅读 · 2012年12月31日

物联网RFID量化路由研究

国家自然科学基金

1+阅读 · 2011年12月31日

前列腺癌特异血清MicroRNA表达谱的病例对照研究

国家自然科学基金

1+阅读 · 2009年12月31日

Robust nonparametric regression: review and practical considerations

Arxiv

0+阅读 · 2023年4月5日

Doubly Stochastic Matrix Models for Estimation of Distribution Algorithms

Arxiv

0+阅读 · 2023年4月5日

Optimal Sketching Bounds for Sparse Linear Regression

Arxiv

0+阅读 · 2023年4月5日

Parallel square-root statistical linear regression for inference in nonlinear state space models

Arxiv

0+阅读 · 2023年4月5日

Outlier Robust and Sparse Estimation of Linear Regression Coefficients

Arxiv

0+阅读 · 2023年4月4日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】面向真实世界音视联合语音识别的可扩展框架

《通过仿真与开源数据提升战略决策：机遇与局限》最新报告

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

评估大语言模型在科学发现中的作用

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Robust nonparametric regression: review and practical considerations

Arxiv

0+阅读 · 2023年4月5日

Doubly Stochastic Matrix Models for Estimation of Distribution Algorithms

Arxiv

0+阅读 · 2023年4月5日

Optimal Sketching Bounds for Sparse Linear Regression

Arxiv

0+阅读 · 2023年4月5日

Parallel square-root statistical linear regression for inference in nonlinear state space models

Arxiv

0+阅读 · 2023年4月5日

Outlier Robust and Sparse Estimation of Linear Regression Coefficients

Arxiv

0+阅读 · 2023年4月4日

相关基金

含铱配合物的肿瘤缺氧敏感的近红外高分子探针

国家自然科学基金

0+阅读 · 2014年12月31日

量子点中电子自旋量子比特的声子效应研究

国家自然科学基金

0+阅读 · 2012年12月31日

Massive MIMO系统关键技术的研究

国家自然科学基金

0+阅读 · 2012年12月31日

物联网RFID量化路由研究

国家自然科学基金

1+阅读 · 2011年12月31日

前列腺癌特异血清MicroRNA表达谱的病例对照研究

国家自然科学基金

1+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员