Bayesian优化稀疏神经网络的可训练激活函数 (Bayesian optimization for sparse neural networks with trainable activation functions)

In the literature on deep neural networks, there is considerable interest in developing activation functions that can enhance neural network performance. In recent years, there has been renewed scientific interest in proposing activation functions that can be trained throughout the learning process, as they appear to improve network performance, especially by reducing overfitting. In this paper, we propose a trainable activation function whose parameters need to be estimated. A fully Bayesian model is developed to automatically estimate from the learning data both the model weights and activation function parameters. An MCMC-based optimization scheme is developed to build the inference. The proposed method aims to solve the aforementioned problems and improve convergence time by using an efficient sampling scheme that guarantees convergence to the global maximum. The proposed scheme is tested on three datasets with three different CNNs. Promising results demonstrate the usefulness of our proposed approach in improving model accuracy due to the proposed activation function and Bayesian estimation of the parameters.

翻译：在深度神经网络的文献中，开发可以增强神经网络性能的激活函数引起了极大的兴趣。近年来，人们重新关注采用在学习过程中可以训练的激活函数，因为它们似乎可以提高网络的性能，尤其是通过减少过拟合来实现。在本文中，我们提出了一个需要估计参数的可训练激活函数。我们开发了一个完全贝叶斯模型来自动从学习数据中估计模型权重和激活函数的参数。我们开发了一个基于MCMC的优化方案来构建推理。所提出的方法旨在解决上述问题，并通过使用一种有效的采样方案来提高收敛时间，从而保证收敛到全局最大值。我们在三个数据集上测试了所提出的方案，使用三个不同的CNN。有希望的结果证明了我们所提出的方法在由于所提出的激活函数和参数的贝叶斯估计而提高模型准确性方面的有用性。

相关内容

激活函数

关注 44

在人工神经网络中，给定一个输入或一组输入，节点的激活函数定义该节点的输出。一个标准集成电路可以看作是一个由激活函数组成的数字网络，根据输入的不同，激活函数可以是开(1)或关(0)。这类似于神经网络中的线性感知器的行为。然而，只有非线性激活函数允许这样的网络只使用少量的节点来计算重要问题，并且这样的激活函数被称为非线性。

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

77+阅读 · 2022年3月15日

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

专知会员服务

74+阅读 · 2020年7月6日

【CMU博士论文】用动态超参数优化改进深度学习训练和推理，Improving Deep Learning Training and Inference with Dynamic Hyperparameter Optimization

专知会员服务

55+阅读 · 2020年5月26日