Probabilities over representations的PAC-Bayesian学习聚合二元激活神经网络 (PAC-Bayesian Learning of Aggregated Binary Activated Neural Networks with Probabilities over Representations) - 专知论文

会员服务 ·

0

PAC学习理论 · 神经网络 · 分析 · 输出 · 随机算法 ·

2023 年 4 月 14 日

PAC-Bayesian Learning of Aggregated Binary Activated Neural Networks with Probabilities over Representations

翻译：Probabilities over representations的PAC-Bayesian学习聚合二元激活神经网络

Louis Fortier-Dubois,Gaël Letarte,Benjamin Leblanc,François Laviolette,Pascal Germain

Considering a probability distribution over parameters is known as an efficient strategy to learn a neural network with non-differentiable activation functions. We study the expectation of a probabilistic neural network as a predictor by itself, focusing on the aggregation of binary activated neural networks with normal distributions over real-valued weights. Our work leverages a recent analysis derived from the PAC-Bayesian framework that derives tight generalization bounds and learning procedures for the expected output value of such an aggregation, which is given by an analytical expression. While the combinatorial nature of the latter has been circumvented by approximations in previous works, we show that the exact computation remains tractable for deep but narrow neural networks, thanks to a dynamic programming approach. This leads us to a peculiar bound minimization learning algorithm for binary activated neural networks, where the forward pass propagates probabilities over representations instead of activation values. A stochastic counterpart that scales to wide architectures is proposed.

翻译：考虑参数分布，是学习非可微激活函数神经网络的有效策略。我们研究神经网络的期望输出作为预测器本身，着重于二元激活神经网络（使用正态分布的实数值权重）的聚合。我们的工作利用了最近从PAC-Bayesian框架中得出的分析，针对上述聚合的期望输出值进行了紧密的泛化界限和学习过程分析，这是由分析式给出的。尽管先前的工作通过近似绕过了其组合性质，但我们证明了对于深而窄的神经网络，通过动态规划方法仍然可以计算其精确值。这使我们得出了一个特殊的二元激活神经网络学习算法，其中前向传递代替激活值，传播表示的概率。我们提出了一个适用于广泛架构的随机算法。

0

相关内容

PAC学习理论

PAC学习理论

PAC学习理论不关心假设选择算法，他关心的是能否从假设空间H中学习一个好的假设h。此理论不关心怎样在假设空间中寻找好的假设，只关心能不能找得到。现在我们在来看一下什么叫“好假设”？只要满足两个条件(PAC辨识条件)即可

【SIGMOD教程】高效数据标签的众包实践:聚合、增量重标签和定价，附180页slides

【SIGMOD教程】高效数据标签的众包实践:聚合、增量重标签和定价，附180页slides

专知会员服务

11+阅读 · 2022年10月20日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

【ICML2020】序数非负矩阵分解推荐，On the Number of Linear Regions of Convolutional Neural Networks

【ICML2020】序数非负矩阵分解推荐，On the Number of Linear Regions of Convolutional Neural Networks

专知会员服务

17+阅读 · 2020年6月4日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

专知会员服务

102+阅读 · 2019年12月9日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图与推荐

10+阅读 · 2020年3月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

笔记 | Deep active learning for named entity recognition

笔记 | Deep active learning for named entity recognition

黑龙江大学自然语言处理实验室

24+阅读 · 2018年5月27日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

植物分子设计中高维数据的低维稀疏逼近方法

国家自然科学基金

0+阅读 · 2015年12月31日

前馈神经网络的结构稀疏化设计与分析

国家自然科学基金

0+阅读 · 2014年12月31日

平移不变子空间的结构

国家自然科学基金

0+阅读 · 2013年12月31日

Pt-Cu二元合金纳米晶微观结构及其催化性能的研究

国家自然科学基金

0+阅读 · 2013年12月31日

自载型有序介孔非贵金属-氮-碳燃料电池阴极氧还原催化材料

国家自然科学基金

0+阅读 · 2013年12月31日

基于三元粗糙输出编码的带自适应惩罚因子的支持向量机多分类模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

A型高维仿射李代数的若干结构和表示的研究

国家自然科学基金

0+阅读 · 2011年12月31日

泛函网络代数理论与学习算法及泛化能力研究

国家自然科学基金

1+阅读 · 2011年12月31日

前馈神经网络学习算法的设计与分析

国家自然科学基金

3+阅读 · 2011年12月31日

多元逼近的贪婪算法与量子算法

国家自然科学基金

0+阅读 · 2009年12月31日

A Study of Bayesian Neural Network Surrogates for Bayesian Optimization

Arxiv

0+阅读 · 2023年5月31日

How Powerful are Shallow Neural Networks with Bandlimited Random Weights?

Arxiv

0+阅读 · 2023年5月30日

Superiority of GNN over NN in generalizing bandlimited functions

Arxiv

0+阅读 · 2023年5月29日

Introduction to Online Nonstochastic Control

Arxiv

0+阅读 · 2023年5月29日

The Iteration Number of the Weisfeiler-Leman Algorithm

Arxiv

0+阅读 · 2023年5月27日

Lagrangian Flow Networks for Conservation Laws

Arxiv

0+阅读 · 2023年5月26日

On the Generalization Capacities of Neural Controlled Differential Equations

Arxiv

0+阅读 · 2023年5月26日

Bayesian inference with finitely wide neural networks

Arxiv

0+阅读 · 2023年5月25日

WeiAvg: Federated Learning Model Aggregation Promoting Data Diversity

Arxiv

0+阅读 · 2023年5月24日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

VIP会员

文章信息

相关主题

PAC学习理论

相关VIP内容

【SIGMOD教程】高效数据标签的众包实践:聚合、增量重标签和定价，附180页slides

【SIGMOD教程】高效数据标签的众包实践:聚合、增量重标签和定价，附180页slides

专知会员服务

11+阅读 · 2022年10月20日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

【ICML2020】序数非负矩阵分解推荐，On the Number of Linear Regions of Convolutional Neural Networks

【ICML2020】序数非负矩阵分解推荐，On the Number of Linear Regions of Convolutional Neural Networks

专知会员服务

17+阅读 · 2020年6月4日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

专知会员服务

102+阅读 · 2019年12月9日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图与推荐

10+阅读 · 2020年3月28日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

笔记 | Deep active learning for named entity recognition

笔记 | Deep active learning for named entity recognition

黑龙江大学自然语言处理实验室

24+阅读 · 2018年5月27日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

A Study of Bayesian Neural Network Surrogates for Bayesian Optimization

Arxiv

0+阅读 · 2023年5月31日

How Powerful are Shallow Neural Networks with Bandlimited Random Weights?

Arxiv

0+阅读 · 2023年5月30日

Superiority of GNN over NN in generalizing bandlimited functions

Arxiv

0+阅读 · 2023年5月29日

Introduction to Online Nonstochastic Control

Arxiv

0+阅读 · 2023年5月29日

The Iteration Number of the Weisfeiler-Leman Algorithm

Arxiv

0+阅读 · 2023年5月27日

Lagrangian Flow Networks for Conservation Laws

Arxiv

0+阅读 · 2023年5月26日

On the Generalization Capacities of Neural Controlled Differential Equations

Arxiv

0+阅读 · 2023年5月26日

Bayesian inference with finitely wide neural networks

Arxiv

0+阅读 · 2023年5月25日

WeiAvg: Federated Learning Model Aggregation Promoting Data Diversity

Arxiv

0+阅读 · 2023年5月24日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

相关基金

植物分子设计中高维数据的低维稀疏逼近方法

国家自然科学基金

0+阅读 · 2015年12月31日

前馈神经网络的结构稀疏化设计与分析

国家自然科学基金

0+阅读 · 2014年12月31日

平移不变子空间的结构

国家自然科学基金

0+阅读 · 2013年12月31日

Pt-Cu二元合金纳米晶微观结构及其催化性能的研究

国家自然科学基金

0+阅读 · 2013年12月31日

自载型有序介孔非贵金属-氮-碳燃料电池阴极氧还原催化材料

国家自然科学基金

0+阅读 · 2013年12月31日

基于三元粗糙输出编码的带自适应惩罚因子的支持向量机多分类模型研究

国家自然科学基金

0+阅读 · 2012年12月31日

A型高维仿射李代数的若干结构和表示的研究

国家自然科学基金

0+阅读 · 2011年12月31日

泛函网络代数理论与学习算法及泛化能力研究

国家自然科学基金

1+阅读 · 2011年12月31日

前馈神经网络学习算法的设计与分析

国家自然科学基金

3+阅读 · 2011年12月31日

多元逼近的贪婪算法与量子算法

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员