基于添加剂高斯进程回归的具有最佳神经神经元激活功能的神经网络 (Neural network with optimal neuron activation functions based on additive Gaussian process regression)

Feed-forward neural networks (NN) are a staple machine learning method widely used in many areas of science and technology. While even a single-hidden layer NN is a universal approximator, its expressive power is limited by the use of simple neuron activation functions (such as sigmoid functions) that are typically the same for all neurons. More flexible neuron activation functions would allow using fewer neurons and layers and thereby save computational cost and improve expressive power. We show that additive Gaussian process regression (GPR) can be used to construct optimal neuron activation functions that are individual to each neuron. An approach is also introduced that avoids non-linear fitting of neural network parameters. The resulting method combines the advantage of robustness of a linear regression with the higher expressive power of a NN. We demonstrate the approach by fitting the potential energy surfaces of the water molecule and formaldehyde. Without requiring any non-linear optimization, the additive GPR based approach outperforms a conventional NN in the high accuracy regime, where a conventional NN suffers more from overfitting.

翻译：进食神经网络(NN)是一种主机学习方法,在科学和技术的许多领域广泛使用。即使单隐藏层NN是一个通用的近似器,但其表达力因使用简单的神经激活功能(如类形功能)而受到限制,这些功能通常对所有神经元都是一样的。更灵活的神经激活功能将允许使用较少的神经元和层,从而节省计算成本,提高表达力。我们表明,添加式高斯进程回归(GPR)可以用来构建每个神经元都属于个人的最佳神经激活功能。还采用了一种避免神经网络参数非线性安装的方法。由此形成的方法将线性回归的强性优势与NNN的更高表达力结合起来。我们通过匹配水分子和甲型丁酸的潜在能源表面来展示这一方法。在不要求任何非线性优化的情况下,基于添加式GPR的方法在高精度系统中超越了常规NN值,因为常规的NNN更难于过度的状态。

相关内容

激活函数

关注 44

在人工神经网络中，给定一个输入或一组输入，节点的激活函数定义该节点的输出。一个标准集成电路可以看作是一个由激活函数组成的数字网络，根据输入的不同，激活函数可以是开(1)或关(0)。这类似于神经网络中的线性感知器的行为。然而，只有非线性激活函数允许这样的网络只使用少量的节点来计算重要问题，并且这样的激活函数被称为非线性。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日