Bayesian神经网络模型模型探索的有效性 (On the Effectiveness of Mode Exploration in Bayesian Model Averaging for Neural Networks) - 专知论文

会员服务 ·

0

模型平均 · 峰值 · Neural Networks · SimPLe · Networking ·

2021 年 12 月 7 日

On the Effectiveness of Mode Exploration in Bayesian Model Averaging for Neural Networks

翻译：Bayesian神经网络模型模型探索的有效性

John T. Holodnak,Allan B. Wollaber

from arxiv, Presented at the ICML 2021 Workshop on Uncertainty and Robustness in Deep Learning

Multiple techniques for producing calibrated predictive probabilities using deep neural networks in supervised learning settings have emerged that leverage approaches to ensemble diverse solutions discovered during cyclic training or training from multiple random starting points (deep ensembles). However, only a limited amount of work has investigated the utility of exploring the local region around each diverse solution (posterior mode). Using three well-known deep architectures on the CIFAR-10 dataset, we evaluate several simple methods for exploring local regions of the weight space with respect to Brier score, accuracy, and expected calibration error. We consider both Bayesian inference techniques (variational inference and Hamiltonian Monte Carlo applied to the softmax output layer) as well as utilizing the stochastic gradient descent trajectory near optima. While adding separate modes to the ensemble uniformly improves performance, we show that the simple mode exploration methods considered here produce little to no improvement over ensembles without mode exploration.

翻译：在有监督的学习环境中,利用深神经网络来产生校准预测概率的多种技术已经出现,这些技术利用各种办法,从多个随机起点(深孔)的周期培训或培训中发现多种混合解决办法,然而,只有有限的工作量调查了围绕每一种不同解决办法(别种模式)探索当地区域是否有用。使用CIFAR-10数据集上三个众所周知的深层结构,我们评估了在Brier评分、准确度和预期校准错误方面探索权重空间的局部区域的若干简单方法。我们认为,巴伊西亚推理技术(变异推断和汉密尔顿·蒙特卡洛都适用于软式马克思输出层)以及利用Popima附近的随机梯度梯度梯度下行轨都是一样。我们为共性一致地改进性能添加了不同的模式,同时我们表明,这里考虑的简单模式勘探方法在不进行模式勘探的情况下,不会产生任何改进。

0

相关内容

模型平均

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【干货】模型不work怎么办？大神Josh Tobin141页PPT告诉你怎么改模型

【干货】模型不work怎么办？大神Josh Tobin141页PPT告诉你怎么改模型

专知会员服务

30+阅读 · 2019年11月21日

【DLBM-SS暑期课程】深度学习与贝叶斯方法 Deep Learning and Bayesian Methods

【DLBM-SS暑期课程】深度学习与贝叶斯方法 Deep Learning and Bayesian Methods

专知会员服务

67+阅读 · 2019年11月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

OpenAI丨深度强化学习关键论文列表

OpenAI丨深度强化学习关键论文列表

中国人工智能学会

17+阅读 · 2018年11月10日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Transport Score Climbing: Variational Inference Using Forward KL and Adaptive Neural Transport

Transport Score Climbing: Variational Inference Using Forward KL and Adaptive Neural Transport

Arxiv

0+阅读 · 2022年2月7日

Superconvergence of Online Optimization for Model Predictive Control

Arxiv

0+阅读 · 2022年2月5日

Topological obstructions in neural networks learning

Arxiv

0+阅读 · 2022年2月5日

What Are Bayesian Neural Network Posteriors Really Like?

Arxiv

8+阅读 · 2021年4月29日

Architecture Disentanglement for Deep Neural Networks

Arxiv

6+阅读 · 2021年3月24日

Neural Architecture Generator Optimization

Arxiv

6+阅读 · 2020年10月8日

Hyperparameter Ensembles for Robustness and Uncertainty Quantification

Arxiv

12+阅读 · 2020年6月24日

Parsimonious Bayesian deep networks

Parsimonious Bayesian deep networks

Arxiv

5+阅读 · 2018年10月17日

Bayesian Convolutional Neural Networks

Arxiv

19+阅读 · 2018年6月27日

The Lovász-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks

Arxiv

4+阅读 · 2018年4月9日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【干货】模型不work怎么办？大神Josh Tobin141页PPT告诉你怎么改模型

【干货】模型不work怎么办？大神Josh Tobin141页PPT告诉你怎么改模型

专知会员服务

30+阅读 · 2019年11月21日

【DLBM-SS暑期课程】深度学习与贝叶斯方法 Deep Learning and Bayesian Methods

【DLBM-SS暑期课程】深度学习与贝叶斯方法 Deep Learning and Bayesian Methods

专知会员服务

67+阅读 · 2019年11月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《面向自主对接机动的机器视觉感知技术研究》最新150页

《基于人工智能与机器学习增强信息环境下作战决策能力》2025年最新95页

美国防部机构推动人工智能红队测试演进：DARPA“战场效能鲁棒性人工智能安全”（SABER）项目

《采用兵棋推演向未来指挥官介绍作战中的信息要素》2025年最新263页

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

OpenAI丨深度强化学习关键论文列表

OpenAI丨深度强化学习关键论文列表

中国人工智能学会

17+阅读 · 2018年11月10日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Transport Score Climbing: Variational Inference Using Forward KL and Adaptive Neural Transport

Transport Score Climbing: Variational Inference Using Forward KL and Adaptive Neural Transport

Arxiv

0+阅读 · 2022年2月7日

Superconvergence of Online Optimization for Model Predictive Control

Arxiv

0+阅读 · 2022年2月5日

Topological obstructions in neural networks learning

Arxiv

0+阅读 · 2022年2月5日

What Are Bayesian Neural Network Posteriors Really Like?

Arxiv

8+阅读 · 2021年4月29日

Architecture Disentanglement for Deep Neural Networks

Arxiv

6+阅读 · 2021年3月24日

Neural Architecture Generator Optimization

Arxiv

6+阅读 · 2020年10月8日

Hyperparameter Ensembles for Robustness and Uncertainty Quantification

Arxiv

12+阅读 · 2020年6月24日

Parsimonious Bayesian deep networks

Parsimonious Bayesian deep networks

Arxiv

5+阅读 · 2018年10月17日

Bayesian Convolutional Neural Networks

Arxiv

19+阅读 · 2018年6月27日

The Lovász-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks

Arxiv

4+阅读 · 2018年4月9日

微信扫码咨询专知VIP会员