DNN的概率代表:连接相互信息和普遍化 (A Probabilistic Representation of DNNs: Bridging Mutual Information and Generalization) - 专知论文

会员服务 ·

0

泛化理论 · INFORMS · 估计/估计量 · 互信息 · 泛化误差 ·

2021 年 6 月 18 日

A Probabilistic Representation of DNNs: Bridging Mutual Information and Generalization

翻译：DNN的概率代表:连接相互信息和普遍化

Xinjie Lan,Kenneth Barner

from arxiv, To appear in the ICML 2021 Workshop on Theoretic Foundation, Criticism, and Application Trend of Explainable AI

Recently, Mutual Information (MI) has attracted attention in bounding the generalization error of Deep Neural Networks (DNNs). However, it is intractable to accurately estimate the MI in DNNs, thus most previous works have to relax the MI bound, which in turn weakens the information theoretic explanation for generalization. To address the limitation, this paper introduces a probabilistic representation of DNNs for accurately estimating the MI. Leveraging the proposed MI estimator, we validate the information theoretic explanation for generalization, and derive a tighter generalization bound than the state-of-the-art relaxations.

翻译：最近,互通信息(MI)在约束深神经网络(DNNs)的笼统错误时引起了人们的注意。然而,准确估计DNNs的误差是难以做到的,因此,大多数以前的工作都必须放松MI约束,这反过来削弱了对概括化的信息理论解释。为了解决这一限制,本文件引入了DNes的概率代表,以准确估计MI。利用拟议的MI测算器,我们验证了信息理论解释的笼统化,并得出比最新放松措施更严格的概括化。

17

相关内容

泛化理论

【MPG & MILA 】因果表示学习，Towards Causal Representation Learning

专知会员服务

52+阅读 · 2021年7月29日

【ICML2021】DNN概率表示:桥接互信息和泛化

专知会员服务

23+阅读 · 2021年6月22日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

不可错过！普渡大学最新《机器学习》课程

专知会员服务

44+阅读 · 2020年11月27日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

专知会员服务

51+阅读 · 2020年8月25日

深度神经网络模型的个体差异，Individual differences among deep neural network models

深度神经网络模型的个体差异，Individual differences among deep neural network models

专知会员服务

10+阅读 · 2020年1月11日

【UAI 2019 Tutorials】可处理概率模型：表示、算法、学习和应用（Tractable Probabilistic Models: Representations, Algorithms, Learning, and Applications）

【UAI 2019 Tutorials】可处理概率模型：表示、算法、学习和应用（Tractable Probabilistic Models: Representations, Algorithms, Learning, and Applications）

专知会员服务

18+阅读 · 2019年11月16日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

互信息论文笔记

互信息论文笔记

CreateAMind

23+阅读 · 2018年8月23日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Arxiv

11+阅读 · 2021年6月25日

Disentangled Information Bottleneck

Disentangled Information Bottleneck

Arxiv

12+阅读 · 2020年12月22日

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark

Arxiv

19+阅读 · 2020年12月17日

Bipartite Graph Embedding via Mutual Information Maximization

Bipartite Graph Embedding via Mutual Information Maximization

Arxiv

9+阅读 · 2020年12月10日

Learning Optimal Representations with the Decodable Information Bottleneck

Arxiv

6+阅读 · 2020年9月27日

Graph Enhanced Representation Learning for News Recommendation

Arxiv

24+阅读 · 2020年3月31日

Efficient Probabilistic Logic Reasoning with Graph Neural Networks

Arxiv

4+阅读 · 2020年2月4日

Probabilistic Logic Neural Networks for Reasoning

Arxiv

7+阅读 · 2019年6月20日

Learning Deep Representations with Probabilistic Knowledge Transfer

Arxiv

5+阅读 · 2019年3月20日

Learning to Predict the Cosmological Structure Formation

Arxiv

3+阅读 · 2018年11月15日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

【MPG & MILA 】因果表示学习，Towards Causal Representation Learning

专知会员服务

52+阅读 · 2021年7月29日

【ICML2021】DNN概率表示:桥接互信息和泛化

专知会员服务

23+阅读 · 2021年6月22日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

不可错过！普渡大学最新《机器学习》课程

专知会员服务

44+阅读 · 2020年11月27日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

专知会员服务

51+阅读 · 2020年8月25日

深度神经网络模型的个体差异，Individual differences among deep neural network models

深度神经网络模型的个体差异，Individual differences among deep neural network models

专知会员服务

10+阅读 · 2020年1月11日

【UAI 2019 Tutorials】可处理概率模型：表示、算法、学习和应用（Tractable Probabilistic Models: Representations, Algorithms, Learning, and Applications）

【UAI 2019 Tutorials】可处理概率模型：表示、算法、学习和应用（Tractable Probabilistic Models: Representations, Algorithms, Learning, and Applications）

专知会员服务

18+阅读 · 2019年11月16日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

互信息论文笔记

互信息论文笔记

CreateAMind

23+阅读 · 2018年8月23日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Arxiv

11+阅读 · 2021年6月25日

Disentangled Information Bottleneck

Disentangled Information Bottleneck

Arxiv

12+阅读 · 2020年12月22日

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark

Arxiv

19+阅读 · 2020年12月17日

Bipartite Graph Embedding via Mutual Information Maximization

Bipartite Graph Embedding via Mutual Information Maximization

Arxiv

9+阅读 · 2020年12月10日

Learning Optimal Representations with the Decodable Information Bottleneck

Arxiv

6+阅读 · 2020年9月27日

Graph Enhanced Representation Learning for News Recommendation

Arxiv

24+阅读 · 2020年3月31日

Efficient Probabilistic Logic Reasoning with Graph Neural Networks

Arxiv

4+阅读 · 2020年2月4日

Probabilistic Logic Neural Networks for Reasoning

Arxiv

7+阅读 · 2019年6月20日

Learning Deep Representations with Probabilistic Knowledge Transfer

Arxiv

5+阅读 · 2019年3月20日

Learning to Predict the Cosmological Structure Formation

Arxiv

3+阅读 · 2018年11月15日

微信扫码咨询专知VIP会员