在弥撒噪音的现场促动 (Boosting in the Presence of Massart Noise) - 专知论文

会员服务 ·

0

Boosting（一种模型训练加速方式） · 噪声 · MoDELS · PAC学习理论 · 学习器 ·

2021 年 6 月 14 日

Boosting in the Presence of Massart Noise

翻译：在弥撒噪音的现场促动

Ilias Diakonikolas,Russell Impagliazzo,Daniel Kane,Rex Lei,Jessica Sorrell,Christos Tzamos

We study the problem of boosting the accuracy of a weak learner in the (distribution-independent) PAC model with Massart noise. In the Massart noise model, the label of each example $x$ is independently misclassified with probability $\eta(x) \leq \eta$, where $\eta<1/2$. The Massart model lies between the random classification noise model and the agnostic model. Our main positive result is the first computationally efficient boosting algorithm in the presence of Massart noise that achieves misclassification error arbitrarily close to $\eta$. Prior to our work, no non-trivial booster was known in this setting. Moreover, we show that this error upper bound is best possible for polynomial-time black-box boosters, under standard cryptographic assumptions. Our upper and lower bounds characterize the complexity of boosting in the distribution-independent PAC model with Massart noise. As a simple application of our positive result, we give the first efficient Massart learner for unions of high-dimensional rectangles.

翻译：我们用Massart噪音模型研究在(独立分配)PAC模型中提高学习能力薄弱者准确性的问题。在Massart噪音模型中,每个例子的标签是美元($\eta(x)\leq\eta$)的单独错误分类,概率为$\eta <1/2美元。Massart模型存在于随机分类噪音模型和不可知模型之间。我们的主要积极结果就是在Massart噪音出现错误分类错误时,第一个计算高效的推算法,这种错误会任意接近$/eta美元。在我们工作之前,在这个环境中,没有已知的非三角推进器。此外,我们显示,根据标准的加密假设,这一错误对多球时黑盒助推器来说是最佳可能的。我们的上下界是使用Massart噪音推进分布独立PAC模型的复杂性。作为我们积极结果的一个简单应用,我们给了第一位高效的Massart学习器,用于高维矩的组合。

0

相关内容

Boosting（一种模型训练加速方式）

Boosting（一种模型训练加速方式）

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【电子书推荐】Data Science with Python and Dask

【电子书推荐】Data Science with Python and Dask

专知会员服务

44+阅读 · 2019年6月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

FlipDA: Effective and Robust Data Augmentation for Few-Shot Learning

Arxiv

0+阅读 · 2021年8月13日

Efficient active learning of sparse halfspaces with arbitrary bounded noise

Arxiv

0+阅读 · 2021年8月13日

Adversarially Robust Low Dimensional Representations

Adversarially Robust Low Dimensional Representations

Arxiv

0+阅读 · 2021年8月13日

On stochastic expansions of empirical distribution function of residuals in autoregression schemes

Arxiv

0+阅读 · 2021年8月12日

On the Explanatory Power of Decision Trees

On the Explanatory Power of Decision Trees

Arxiv

0+阅读 · 2021年8月11日

Higher-Order Expansion and Bartlett Correctability of Distributionally Robust Optimization

Arxiv

0+阅读 · 2021年8月11日

Best lower bound on the probability of a binomial exceeding its expectation

Arxiv

0+阅读 · 2021年8月10日

Fundamental Tradeoffs in Distributionally Adversarial Training

Arxiv

9+阅读 · 2021年1月15日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

Understanding disentangling in $β$-VAE

Arxiv

4+阅读 · 2018年4月10日

VIP会员

文章信息

相关主题

Boosting（一种模型训练加速方式）

PAC学习理论

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【电子书推荐】Data Science with Python and Dask

【电子书推荐】Data Science with Python and Dask

专知会员服务

44+阅读 · 2019年6月1日

热门VIP内容

开通专知VIP会员享更多权益服务

《人工智能绝不能完全自主》

《人工智能的法律与伦理：军事自主机器独特挑战的深度剖析》316页

从数据到主导：AI与兵棋推演构筑决策优势

《特洛伊木马货柜：武器化集装箱的战略威胁》最新报告

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

FlipDA: Effective and Robust Data Augmentation for Few-Shot Learning

Arxiv

0+阅读 · 2021年8月13日

Efficient active learning of sparse halfspaces with arbitrary bounded noise

Arxiv

0+阅读 · 2021年8月13日

Adversarially Robust Low Dimensional Representations

Adversarially Robust Low Dimensional Representations

Arxiv

0+阅读 · 2021年8月13日

On stochastic expansions of empirical distribution function of residuals in autoregression schemes

Arxiv

0+阅读 · 2021年8月12日

On the Explanatory Power of Decision Trees

On the Explanatory Power of Decision Trees

Arxiv

0+阅读 · 2021年8月11日

Higher-Order Expansion and Bartlett Correctability of Distributionally Robust Optimization

Arxiv

0+阅读 · 2021年8月11日

Best lower bound on the probability of a binomial exceeding its expectation

Arxiv

0+阅读 · 2021年8月10日

Fundamental Tradeoffs in Distributionally Adversarial Training

Arxiv

9+阅读 · 2021年1月15日

A Survey on Causal Inference

Arxiv

112+阅读 · 2020年2月5日

Understanding disentangling in $β$-VAE

Arxiv

4+阅读 · 2018年4月10日

微信扫码咨询专知VIP会员