通过数据依赖性通用环 (Robustness Implies Generalization via Data-Dependent Generalization Bounds) - 专知论文

会员服务 ·

0

泛化理论 · 稳健性 · 可约的 · 假设空间 · 随机变量 ·

2022 年 8 月 3 日

Robustness Implies Generalization via Data-Dependent Generalization Bounds

翻译：通过数据依赖性通用环

Kenji Kawaguchi,Zhun Deng,Kyle Luh,Jiaoyang Huang

from arxiv, Accepted by ICML 2022, and selected for ICML long presentation (top 2% of submissions)

This paper proves that robustness implies generalization via data-dependent generalization bounds. As a result, robustness and generalization are shown to be connected closely in a data-dependent manner. Our bounds improve previous bounds in two directions, to solve an open problem that has seen little development since 2010. The first is to reduce the dependence on the covering number. The second is to remove the dependence on the hypothesis space. We present several examples, including ones for lasso and deep learning, in which our bounds are provably preferable. The experiments on real-world data and theoretical models demonstrate near-exponential improvements in various situations. To achieve these improvements, we do not require additional assumptions on the unknown distribution; instead, we only incorporate an observable and computable property of the training samples. A key technical innovation is an improved concentration bound for multinomial random variables that is of independent interest beyond robustness and generalization.

翻译：本文证明,稳健性意味着通过依赖数据的一般化界限进行一般化。因此, 稳健性和一般化被证明以依赖数据的方式密切关联。我们的界限在两个方向上改进了先前的界限, 以解决一个自2010年以来发展甚微的开放问题。第一个是减少对覆盖数字的依赖。第二个是消除对假设空间的依赖。我们举了几个例子, 包括拉索和深层次学习的例子, 我们的界限是相当可取的。现实世界数据和理论模型的实验表明, 各种情况都有近乎迅速的改进。为了实现这些改进, 我们不需要对未知分布进行更多的假设; 相反, 我们只需要对培训样品的可观测和可计算属性进行整合。一个关键的技术创新是, 将多数值随机变量的浓度进一步集中, 后者具有独立的兴趣, 超出了稳健性和概括性。

0

相关内容

泛化理论

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

非线性差分方程的最小周期解与边值问题研究

国家自然科学基金

0+阅读 · 2015年12月31日

GAPDS在糖尿病引起的不育症中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

TNF-α诱导鼻咽癌淋巴管生成和淋巴结转移的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

虚旋风气溶胶分级采样器的采样机理及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

硅基p-n结半导体输运行为的磁场调控

国家自然科学基金

0+阅读 · 2012年12月31日

硅锗基声子热电薄膜中的声子和电子输运特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

Ⅲ-Ⅴ族半导体衬底上铪基高k栅介质界面特性研究

国家自然科学基金

0+阅读 · 2011年12月31日

磁性分子结及量子点系统中的输运性质及量子相变

国家自然科学基金

0+阅读 · 2011年12月31日

旋转玻色爱因斯坦凝聚体的涡旋态理论研究

国家自然科学基金

0+阅读 · 2009年12月31日

On the optimization and generalization of overparameterized implicit neural networks

Arxiv

0+阅读 · 2022年9月30日

Risk Control for Online Learning Models

Arxiv

0+阅读 · 2022年9月30日

Structured Optimal Variational Inference for Dynamic Latent Space Models

Arxiv

0+阅读 · 2022年9月29日

EiHi Net: Out-of-Distribution Generalization Paradigm

EiHi Net: Out-of-Distribution Generalization Paradigm

Arxiv

0+阅读 · 2022年9月29日

On Constructing Spanners from Random Gaussian Projections

Arxiv

0+阅读 · 2022年9月29日

A deep learning approach for the computation of curvature in the level-set method

Arxiv

0+阅读 · 2022年9月28日

Reconstruction-guided attention improves the robustness and shape processing of neural networks

Arxiv

0+阅读 · 2022年9月27日

Information-theoretic generalization bounds for black-box learning algorithms

Arxiv

12+阅读 · 2021年10月4日

Causality and Generalizability: Identifiability and Learning Methods

Arxiv

12+阅读 · 2021年10月4日

vGraph: A Generative Model for Joint Community Detection and Node Representation Learning

vGraph: A Generative Model for Joint Community Detection and Node Representation Learning

Arxiv

14+阅读 · 2019年9月17日

VIP会员

文章信息

相关主题

相关VIP内容

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

GPT-5如何对齐？从硬性拒绝到安全完成：走向以输出为中心的安全训练

【伯克利博士论文】超越人类监督的视觉智能

【ICCV2025】SO(3) 上连续非保守动力系统的预测

2025年中国数据要素行业发展研究报告

相关资讯

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

On the optimization and generalization of overparameterized implicit neural networks

Arxiv

0+阅读 · 2022年9月30日

Risk Control for Online Learning Models

Arxiv

0+阅读 · 2022年9月30日

Structured Optimal Variational Inference for Dynamic Latent Space Models

Arxiv

0+阅读 · 2022年9月29日

EiHi Net: Out-of-Distribution Generalization Paradigm

EiHi Net: Out-of-Distribution Generalization Paradigm

Arxiv

0+阅读 · 2022年9月29日

On Constructing Spanners from Random Gaussian Projections

Arxiv

0+阅读 · 2022年9月29日

A deep learning approach for the computation of curvature in the level-set method

Arxiv

0+阅读 · 2022年9月28日

Reconstruction-guided attention improves the robustness and shape processing of neural networks

Arxiv

0+阅读 · 2022年9月27日

Information-theoretic generalization bounds for black-box learning algorithms

Arxiv

12+阅读 · 2021年10月4日

Causality and Generalizability: Identifiability and Learning Methods

Arxiv

12+阅读 · 2021年10月4日

vGraph: A Generative Model for Joint Community Detection and Node Representation Learning

vGraph: A Generative Model for Joint Community Detection and Node Representation Learning

Arxiv

14+阅读 · 2019年9月17日

相关基金

非线性差分方程的最小周期解与边值问题研究

国家自然科学基金

0+阅读 · 2015年12月31日

GAPDS在糖尿病引起的不育症中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

TNF-α诱导鼻咽癌淋巴管生成和淋巴结转移的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

虚旋风气溶胶分级采样器的采样机理及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

硅基p-n结半导体输运行为的磁场调控

国家自然科学基金

0+阅读 · 2012年12月31日

硅锗基声子热电薄膜中的声子和电子输运特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

Ⅲ-Ⅴ族半导体衬底上铪基高k栅介质界面特性研究

国家自然科学基金

0+阅读 · 2011年12月31日

磁性分子结及量子点系统中的输运性质及量子相变

国家自然科学基金

0+阅读 · 2011年12月31日

旋转玻色爱因斯坦凝聚体的涡旋态理论研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员