用于防御统一趋同:通过对内推预测器的应用去异性化而普遍化 (In Defense of Uniform Convergence: Generalization via derandomization with an application to interpolating predictors) - 专知论文

会员服务 ·

0

泛化理论 · 泛化误差 · 预测器/决策函数 · UniFormer · 经验风险 ·

2021 年 9 月 10 日

In Defense of Uniform Convergence: Generalization via derandomization with an application to interpolating predictors

翻译：用于防御统一趋同:通过对内推预测器的应用去异性化而普遍化

Jeffrey Negrea,Gintare Karolina Dziugaite,Daniel M. Roy

from arxiv, 14 pages before references and appendices. 23 pages total. Includes a correction to Lemma 5.3 and Theorem 5.4, and their proofs

We propose to study the generalization error of a learned predictor $\hat h$ in terms of that of a surrogate (potentially randomized) predictor that is coupled to $\hat h$ and designed to trade empirical risk for control of generalization error. In the case where $\hat h$ interpolates the data, it is interesting to consider theoretical surrogate classifiers that are partially derandomized or rerandomized, e.g., fit to the training data but with modified label noise. We also show that replacing $\hat h$ by its conditional distribution with respect to an arbitrary $\sigma$-field is a convenient way to derandomize. We study two examples, inspired by the work of Nagarajan and Kolter (2019) and Bartlett et al. (2019), where the learned classifier $\hat h$ interpolates the training data with high probability, has small risk, and, yet, does not belong to a nonrandom class with a tight uniform bound on two-sided generalization error. At the same time, we bound the risk of $\hat h$ in terms of surrogates constructed by conditioning and denoising, respectively, and shown to belong to nonrandom classes with uniformly small generalization error.

翻译：我们建议研究一个学习的预测元$h$(潜在随机化)替代预测元(可能随机化)的通用差错,该预测元与美元美元相联,旨在将经验风险用于控制一般差错。在美元和美元之间对数据进行内部调试的情况下,我们建议研究部分解密或重新调整的理论替代分类元的通用差错,例如,与培训数据相适应,但使用修改的标签噪音。我们还表明,以任意的美元(gigma$-field)的有条件分配取代美元(h$)是解禁的方便方法。我们研究了两个例子,这些例子受Nagarajan和Kolter(2019年)和Bartlett等人(2019年)的工作启发,在这类例子中,学习的分类元和美元对培训数据进行部分解密或重新调整的可能性很大,风险很小,然而,也不属于在两面通用差错上严格统一的非随机类。与此同时,我们将美元的风险与不固定的等级分别表现为不固定的等级和不固定的等级。

0

相关内容

泛化理论

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

【CMU】可扩展人工智能白皮书

专知会员服务

28+阅读 · 2021年7月3日

数字化健康白皮书，17页pdf

数字化健康白皮书，17页pdf

专知会员服务

109+阅读 · 2021年1月6日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【NeurIPS2019 论文】一致收敛可能无法解释深度学习中的泛化现象（Uniform convergence may be unable to explain generalization in deep learning）

【NeurIPS2019 论文】一致收敛可能无法解释深度学习中的泛化现象（Uniform convergence may be unable to explain generalization in deep learning）

专知会员服务

4+阅读 · 2019年12月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Convergence and Semi-convergence of a class of constrained block iterative methods

Arxiv

0+阅读 · 2021年10月30日

Antipodes of Label Differential Privacy: PATE and ALIBI

Arxiv

1+阅读 · 2021年10月29日

A/B/n Testing with Control in the Presence of Subpopulations

Arxiv

0+阅读 · 2021年10月29日

Optimal prediction for kernel-based semi-functional linear regression

Arxiv

0+阅读 · 2021年10月29日

Approximating the Arboricity in Sublinear Time

Arxiv

0+阅读 · 2021年10月28日

Engineering Uniform Sampling of Graphs with a Prescribed Power-law Degree Sequence

Arxiv

0+阅读 · 2021年10月28日

Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models

Arxiv

0+阅读 · 2021年10月28日

A Farewell to the Bias-Variance Tradeoff? An Overview of the Theory of Overparameterized Machine Learning

Arxiv

15+阅读 · 2021年9月6日

Deep Stable Learning for Out-Of-Distribution Generalization

Arxiv

12+阅读 · 2021年4月16日

Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data

Arxiv

9+阅读 · 2021年2月8日

VIP会员

文章信息

相关主题

预测器/决策函数

相关VIP内容

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

【CMU】可扩展人工智能白皮书

专知会员服务

28+阅读 · 2021年7月3日

数字化健康白皮书，17页pdf

数字化健康白皮书，17页pdf

专知会员服务

109+阅读 · 2021年1月6日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【NeurIPS2019 论文】一致收敛可能无法解释深度学习中的泛化现象（Uniform convergence may be unable to explain generalization in deep learning）

【NeurIPS2019 论文】一致收敛可能无法解释深度学习中的泛化现象（Uniform convergence may be unable to explain generalization in deep learning）

专知会员服务

4+阅读 · 2019年12月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

面向具身智能的多模态数据存储与检索：综述

《算法战争研究计划全景评估》35页

【CMU博士论文】水下三维视觉感知与生成

智能体战争：自主人工智能军备竞赛全景透视

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Convergence and Semi-convergence of a class of constrained block iterative methods

Arxiv

0+阅读 · 2021年10月30日

Antipodes of Label Differential Privacy: PATE and ALIBI

Arxiv

1+阅读 · 2021年10月29日

A/B/n Testing with Control in the Presence of Subpopulations

Arxiv

0+阅读 · 2021年10月29日

Optimal prediction for kernel-based semi-functional linear regression

Arxiv

0+阅读 · 2021年10月29日

Approximating the Arboricity in Sublinear Time

Arxiv

0+阅读 · 2021年10月28日

Engineering Uniform Sampling of Graphs with a Prescribed Power-law Degree Sequence

Arxiv

0+阅读 · 2021年10月28日

Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models

Arxiv

0+阅读 · 2021年10月28日

A Farewell to the Bias-Variance Tradeoff? An Overview of the Theory of Overparameterized Machine Learning

Arxiv

15+阅读 · 2021年9月6日

Deep Stable Learning for Out-Of-Distribution Generalization

Arxiv

12+阅读 · 2021年4月16日

Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data

Arxiv

9+阅读 · 2021年2月8日

微信扫码咨询专知VIP会员