所有你需要的是一个好功能前贝叶斯深层学习 (All You Need is a Good Functional Prior for Bayesian Deep Learning)

The Bayesian treatment of neural networks dictates that a prior distribution is specified over their weight and bias parameters. This poses a challenge because modern neural networks are characterized by a large number of parameters, and the choice of these priors has an uncontrolled effect on the induced functional prior, which is the distribution of the functions obtained by sampling the parameters from their prior distribution. We argue that this is a hugely limiting aspect of Bayesian deep learning, and this work tackles this limitation in a practical and effective way. Our proposal is to reason in terms of functional priors, which are easier to elicit, and to "tune" the priors of neural network parameters in a way that they reflect such functional priors. Gaussian processes offer a rigorous framework to define prior distributions over functions, and we propose a novel and robust framework to match their prior with the functional prior of neural networks based on the minimization of their Wasserstein distance. We provide vast experimental evidence that coupling these priors with scalable Markov chain Monte Carlo sampling offers systematically large performance improvements over alternative choices of priors and state-of-the-art approximate Bayesian deep learning approaches. We consider this work a considerable step in the direction of making the long-standing challenge of carrying out a fully Bayesian treatment of neural networks, including convolutional neural networks, a concrete possibility.

翻译：贝叶斯对神经网络的处理表明,事先的分布取决于其重量和偏差参数。这构成一个挑战,因为现代神经网络的特征是有大量参数,而选择这些前科对诱发性功能前科具有不受控制的影响,即通过取样从先前分布参数获得的功能的分布。我们争辩说,这是贝叶斯深刻学习的一个极为有限的方面,这项工作以实际和有效的方式解决了这一限制。我们的提议是以功能前科为根据,因为功能前科较容易获得,而“调”前科反映这些前科的神经网络参数,从而反映这些前科的功能前科。高斯进程为界定先前功能前科提供了严格的框架,界定先前功能前科的分布提供了严格的框架,我们提出了一个新的和健全的框架,以尽量减少其瓦塞斯特斯坦距离为基础,使其先前的神经网络的运作与前科网络的功能相匹配。我们提供了大量实验证据,证明这些前科与可伸缩的马克夫链蒙卡洛抽样取样为系统提供了大规模的业绩改进改进,取代了前科和前科先科的先科选择,从而反映巴伊斯河网络长期学习方向的一个相当大的挑战。我们认为,认为,认为,这是海湾网络的深层次研究的一个方向。

相关内容

Neural Networks

关注 1648

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日