用于斯托卡学习算法的速率扭曲率理论一般化弹道 (Rate-Distortion Theoretic Generalization Bounds for Stochastic Learning Algorithms) - 专知论文

会员服务 ·

0

泛化理论 · Learning · Fractal · 假设空间 · 互信息 ·

2022 年 6 月 29 日

Rate-Distortion Theoretic Generalization Bounds for Stochastic Learning Algorithms

翻译：用于斯托卡学习算法的速率扭曲率理论一般化弹道

Milad Sefidgaran,Amin Gohari,Gaël Richard,Umut Şimşekli

from arxiv, Accepted for presentation at the Conference on Learning Theory (COLT) 2022

Understanding generalization in modern machine learning settings has been one of the major challenges in statistical learning theory. In this context, recent years have witnessed the development of various generalization bounds suggesting different complexity notions such as the mutual information between the data sample and the algorithm output, compressibility of the hypothesis space, and the fractal dimension of the hypothesis space. While these bounds have illuminated the problem at hand from different angles, their suggested complexity notions might appear seemingly unrelated, thereby restricting their high-level impact. In this study, we prove novel generalization bounds through the lens of rate-distortion theory, and explicitly relate the concepts of mutual information, compressibility, and fractal dimensions in a single mathematical framework. Our approach consists of (i) defining a generalized notion of compressibility by using source coding concepts, and (ii) showing that the `compression error rate' can be linked to the generalization error both in expectation and with high probability. We show that in the `lossless compression' setting, we recover and improve existing mutual information-based bounds, whereas a `lossy compression' scheme allows us to link generalization to the rate-distortion dimension -- a particular notion of fractal dimension. Our results bring a more unified perspective on generalization and open up several future research directions.

翻译：在现代机器学习环境中,普遍理解现代机器学习环境是统计学习理论的主要挑战之一。在这方面,近些年来,我们目睹了各种一般化界限的发展,表明不同的复杂概念,例如数据抽样和算法输出之间的相互信息、假设空间的压缩和假设空间的分形维度。虽然这些界限从不同的角度揭示了手头的问题,但所提出的复杂概念似乎似乎无关紧要,从而限制了其高层次的影响。在本研究中,我们证明,通过比率扭曲理论的镜像,有新的一般化界限,明确将相互信息、压缩和分解维度的概念纳入一个单一数学框架。我们的方法包括:(一) 使用源编码概念界定一个普遍化的精确性概念,以及(二) 表明“压力错误率”既可以与预期的普遍化错误相联系,也可以与高概率挂钩。我们在“不损失压缩”的设置中,我们恢复并改进了现有的相互基于信息的界限,而“损失压缩”和“分解度”的维度在单一数学框架中则明确地将相互信息概念明确联系起来。我们的方法包括:(一)通过利用源码编码来界定一个更统一性的一般方向,使我们将某些的分位化的分化。

0

相关内容

泛化理论

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Debye-yukawa位势下空间齐性玻尔兹曼方程的定性理论

国家自然科学基金

0+阅读 · 2015年12月31日

循环干扰信道的容量和高效编码传输方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

广义欧拉多项式的实根性

国家自然科学基金

0+阅读 · 2015年12月31日

腺苷酸活化蛋白激酶促进MET过程对胃癌细胞增殖和侵袭力的影响及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于混合簇稀疏/散射信道建模的水声异步双向中继传输方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于压缩感知的稀疏水声阵列信号处理机理研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于协方差理论的UCT动态关联算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

一种基于偏微分方程面片的三维几何模型参数表示方案

国家自然科学基金

0+阅读 · 2012年12月31日

基于图域几何PDE与特征不变量的离散曲面处理

国家自然科学基金

0+阅读 · 2009年12月31日

±800kV直流输电线路雷击电流波形反演恢复研究

国家自然科学基金

0+阅读 · 2009年12月31日

Towards standard imsets for maximal ancestral graphs

Arxiv

0+阅读 · 2022年8月22日

An Improved Algorithm for The $k$-Dyck Edit Distance Problem

Arxiv

0+阅读 · 2022年8月22日

A Risk-Sensitive Approach to Policy Optimization

Arxiv

0+阅读 · 2022年8月19日

Optimized Equivalent Linearization for Random Vibration

Arxiv

0+阅读 · 2022年8月18日

Non-Asymptotic Analysis of Stochastic Approximation Algorithms for Streaming Data

Arxiv

0+阅读 · 2022年8月18日

Improved Distributed Algorithms for the Lovász Local Lemma and Edge Coloring

Arxiv

0+阅读 · 2022年8月18日

Asymptotic Analysis of Parameter Estimation for Ewens--Pitman Partition

Arxiv

0+阅读 · 2022年8月18日

Information-theoretic generalization bounds for black-box learning algorithms

Arxiv

12+阅读 · 2021年10月4日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

Optimization for deep learning: theory and algorithms

Optimization for deep learning: theory and algorithms

Arxiv

106+阅读 · 2019年12月19日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Towards standard imsets for maximal ancestral graphs

Arxiv

0+阅读 · 2022年8月22日

An Improved Algorithm for The $k$-Dyck Edit Distance Problem

Arxiv

0+阅读 · 2022年8月22日

A Risk-Sensitive Approach to Policy Optimization

Arxiv

0+阅读 · 2022年8月19日

Optimized Equivalent Linearization for Random Vibration

Arxiv

0+阅读 · 2022年8月18日

Non-Asymptotic Analysis of Stochastic Approximation Algorithms for Streaming Data

Arxiv

0+阅读 · 2022年8月18日

Improved Distributed Algorithms for the Lovász Local Lemma and Edge Coloring

Arxiv

0+阅读 · 2022年8月18日

Asymptotic Analysis of Parameter Estimation for Ewens--Pitman Partition

Arxiv

0+阅读 · 2022年8月18日

Information-theoretic generalization bounds for black-box learning algorithms

Arxiv

12+阅读 · 2021年10月4日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

Optimization for deep learning: theory and algorithms

Optimization for deep learning: theory and algorithms

Arxiv

106+阅读 · 2019年12月19日

相关基金

Debye-yukawa位势下空间齐性玻尔兹曼方程的定性理论

国家自然科学基金

0+阅读 · 2015年12月31日

循环干扰信道的容量和高效编码传输方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

广义欧拉多项式的实根性

国家自然科学基金

0+阅读 · 2015年12月31日

腺苷酸活化蛋白激酶促进MET过程对胃癌细胞增殖和侵袭力的影响及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于混合簇稀疏/散射信道建模的水声异步双向中继传输方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于压缩感知的稀疏水声阵列信号处理机理研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于协方差理论的UCT动态关联算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

一种基于偏微分方程面片的三维几何模型参数表示方案

国家自然科学基金

0+阅读 · 2012年12月31日

基于图域几何PDE与特征不变量的离散曲面处理

国家自然科学基金

0+阅读 · 2009年12月31日

±800kV直流输电线路雷击电流波形反演恢复研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员