改进神经网络通用化 (Hybridised Loss Functions for Improved Neural Network Generalisation)

Loss functions play an important role in the training of artificial neural networks (ANNs), and can affect the generalisation ability of the ANN model, among other properties. Specifically, it has been shown that the cross entropy and sum squared error loss functions result in different training dynamics, and exhibit different properties that are complementary to one another. It has previously been suggested that a hybrid of the entropy and sum squared error loss functions could combine the advantages of the two functions, while limiting their disadvantages. The effectiveness of such hybrid loss functions is investigated in this study. It is shown that hybridisation of the two loss functions improves the generalisation ability of the ANNs on all problems considered. The hybrid loss function that starts training with the sum squared error loss function and later switches to the cross entropy error loss function is shown to either perform the best on average, or to not be significantly different than the best loss function tested for all problems considered. This study shows that the minima discovered by the sum squared error loss function can be further exploited by switching to cross entropy error loss function. It can thus be concluded that hybridisation of the two loss functions could lead to better performance in ANNs.

翻译：损失功能在人工神经网络(ANNs)的培训中起着重要作用,并可能影响ANN模型的普及能力。具体地说,已经证明交叉环流和总正方差损失功能导致不同的培训动态,并显示出不同的属性,这些功能是相辅相成的。以前曾指出,对正方差和正方差损失功能的混合作用可以结合这两种功能的优势,同时限制其劣势。本研究报告调查了这种混合损失功能的有效性。事实证明,两种损失功能的混合作用提高了ANNS在所考虑的所有问题上的普遍化能力。开始以正方差错误功能进行训练的混合损失函数,以及后来开始的跨方差错损失函数的开关,要么平均表现最佳,要么与所考虑的所有问题所测试的最佳损失函数没有重大差别。这项研究表明,通过转换为交叉错误损失功能,可以进一步利用由总正方差损失函数发现的微值。因此可以断定,两种损失函数的混合作用在ANNS中会更好表现。

相关内容

损失函数（机器学习）

关注 10

损失函数，在AI中亦称呼距离函数，度量函数。此处的距离代表的是抽象性的，代表真实数据与预测数据之间的误差。损失函数（loss function）是用来估量你模型的预测值f(x)与真实值Y的不一致程度，它是一个非负实值函数,通常使用L(Y, f(x))来表示，损失函数越小，模型的鲁棒性就越好。损失函数是经验风险函数的核心部分，也是结构风险函数重要组成部分。

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日