Bayes人退学 (Bayesian Dropout) - 专知论文

会员服务 ·

0

暂退法 · MoDELS · 近似 · 易处理的 · 对数几率回归 ·

2022 年 5 月 17 日

Bayesian Dropout

翻译：Bayes人退学

Tue Herlau,Morten Mørup,Mikkel N. Schmidt

from arxiv, 21 pages, 3 figures. Manuscript prepared 2014 and awaiting submission

Dropout has recently emerged as a powerful and simple method for training neural networks preventing co-adaptation by stochastically omitting neurons. Dropout is currently not grounded in explicit modelling assumptions which so far has precluded its adoption in Bayesian modelling. Using Bayesian entropic reasoning we show that dropout can be interpreted as optimal inference under constraints. We demonstrate this on an analytically tractable regression model providing a Bayesian interpretation of its mechanism for regularizing and preventing co-adaptation as well as its connection to other Bayesian techniques. We also discuss two general approximate techniques for applying Bayesian dropout for general models, one based on an analytical approximation and the other on stochastic variational techniques. These techniques are then applied to a Baysian logistic regression problem and are shown to improve performance as the model become more misspecified. Our framework roots dropout as a theoretically justified and practical tool for statistical modelling allowing Bayesians to tap into the benefits of dropout training.

翻译：最近,辍学现象已成为一种强大而简单的方法,用于培训神经网络,防止神经网络通过随机忽略的神经元进行共同适应;目前,辍学现象并非基于明确的模型假设,迄今为止,这些假设使Bayesian模型无法采用。我们利用Bayesian entropic推理表明,辍学可以被解释为制约下的最佳推论。我们用一种分析可导引回归模型来证明这一点,这种模型为Bayesian人提供了一种解释,说明其常规和预防共同适应的机制及其与其他Bayesian技术的联系。我们还讨论了将Bayesian人辍学应用于普通模型的两种一般近似技术,一种基于分析近似法,另一种基于随机变异技术。这些技术随后被应用于一个Baysian物流回归问题,并证明随着模型的描述更加错误,可以提高绩效。我们的框架根辍学现象是一种理论上合理和实用的统计模型工具,使Bayesian人能够利用辍学培训的好处。

0

相关内容

暂退法

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【深度学习中的不确定性-贝叶斯CNN | TensorFlow概率】Uncertainty In Deep Learning — Bayesian CNN | TensorFlow Probability

【深度学习中的不确定性-贝叶斯CNN | TensorFlow概率】Uncertainty In Deep Learning — Bayesian CNN | TensorFlow Probability

专知会员服务

40+阅读 · 2022年3月19日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

机器学习研究会

36+阅读 · 2017年12月10日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【推荐】MXNet深度情感分析实战

【推荐】MXNet深度情感分析实战

机器学习研究会

16+阅读 · 2017年10月4日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

EADIA调节抑癌基因DCC凋亡通路的分子机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

痛觉超敏疼痛矩阵失衡及其针刺镇痛的再平衡机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

SATB2基因缺失/突变与新生儿原因不明的神经发育障碍之间的关系及分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于L1范数约束稀疏矩阵分解的基因表达谱数据分析

国家自然科学基金

0+阅读 · 2012年12月31日

Dirac费米子体系纳米结构的电子性质研究

国家自然科学基金

0+阅读 · 2012年12月31日

氧化-内质网应激通路在弓形虫感染致胎盘滋养细胞凋亡中的作用及干预研究

国家自然科学基金

0+阅读 · 2011年12月31日

图在曲面上嵌入的分类

国家自然科学基金

0+阅读 · 2011年12月31日

Batten Disease (BD)神经元退化病理机制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

p16甲基化在早衰细胞逃逸衰老与ALT肿瘤化中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

果蝇CG12765基因功能和分子机制的研究

国家自然科学基金

0+阅读 · 2008年12月31日

PRoA: A Probabilistic Robustness Assessment against Functional Perturbations

Arxiv

0+阅读 · 2022年7月5日

$π$VAE: a stochastic process prior for Bayesian deep learning with MCMC

Arxiv

0+阅读 · 2022年7月5日

Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers

Arxiv

0+阅读 · 2022年7月4日

Stability Approach to Regularization Selection for Reduced-Rank Regression

Arxiv

0+阅读 · 2022年7月3日

Tree ensemble kernels for Bayesian optimization with known constraints over mixed-feature spaces

Arxiv

0+阅读 · 2022年7月2日

Near-Optimal High Probability Complexity Bounds for Non-Smooth Stochastic Optimization with Heavy-Tailed Noise

Arxiv

0+阅读 · 2022年7月1日

Adaptation of the Tuning Parameter in General Bayesian Inference with Robust Divergence

Arxiv

0+阅读 · 2022年7月1日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Bayesian Deep Learning via Subnetwork Inference

Arxiv

10+阅读 · 2021年2月18日

Bayesian Convolutional Neural Networks

Arxiv

19+阅读 · 2018年6月27日

VIP会员

文章信息

相关主题

对数几率回归

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【深度学习中的不确定性-贝叶斯CNN | TensorFlow概率】Uncertainty In Deep Learning — Bayesian CNN | TensorFlow Probability

【深度学习中的不确定性-贝叶斯CNN | TensorFlow概率】Uncertainty In Deep Learning — Bayesian CNN | TensorFlow Probability

专知会员服务

40+阅读 · 2022年3月19日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

NIPS 2017：贝叶斯深度学习与深度贝叶斯学习（讲义+视频）

机器学习研究会

36+阅读 · 2017年12月10日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【推荐】MXNet深度情感分析实战

【推荐】MXNet深度情感分析实战

机器学习研究会

16+阅读 · 2017年10月4日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

PRoA: A Probabilistic Robustness Assessment against Functional Perturbations

Arxiv

0+阅读 · 2022年7月5日

$π$VAE: a stochastic process prior for Bayesian deep learning with MCMC

Arxiv

0+阅读 · 2022年7月5日

Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers

Arxiv

0+阅读 · 2022年7月4日

Stability Approach to Regularization Selection for Reduced-Rank Regression

Arxiv

0+阅读 · 2022年7月3日

Tree ensemble kernels for Bayesian optimization with known constraints over mixed-feature spaces

Arxiv

0+阅读 · 2022年7月2日

Near-Optimal High Probability Complexity Bounds for Non-Smooth Stochastic Optimization with Heavy-Tailed Noise

Arxiv

0+阅读 · 2022年7月1日

Adaptation of the Tuning Parameter in General Bayesian Inference with Robust Divergence

Arxiv

0+阅读 · 2022年7月1日

Bayesian Deep Learning for Graphs

Arxiv

23+阅读 · 2022年2月24日

Bayesian Deep Learning via Subnetwork Inference

Arxiv

10+阅读 · 2021年2月18日

Bayesian Convolutional Neural Networks

Arxiv

19+阅读 · 2018年6月27日

相关基金

EADIA调节抑癌基因DCC凋亡通路的分子机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

痛觉超敏疼痛矩阵失衡及其针刺镇痛的再平衡机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

SATB2基因缺失/突变与新生儿原因不明的神经发育障碍之间的关系及分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于L1范数约束稀疏矩阵分解的基因表达谱数据分析

国家自然科学基金

0+阅读 · 2012年12月31日

Dirac费米子体系纳米结构的电子性质研究

国家自然科学基金

0+阅读 · 2012年12月31日

氧化-内质网应激通路在弓形虫感染致胎盘滋养细胞凋亡中的作用及干预研究

国家自然科学基金

0+阅读 · 2011年12月31日

图在曲面上嵌入的分类

国家自然科学基金

0+阅读 · 2011年12月31日

Batten Disease (BD)神经元退化病理机制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

p16甲基化在早衰细胞逃逸衰老与ALT肿瘤化中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

果蝇CG12765基因功能和分子机制的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员