使用 $@ell_1$-区域化和双性忠诚数据进行神经网络培训 (Neural Network Training Using $\ell_1$-Regularization and Bi-fidelity Data)

With the capability of accurately representing a functional relationship between the inputs of a physical system's model and output quantities of interest, neural networks have become popular for surrogate modeling in scientific applications. However, as these networks are over-parameterized, their training often requires a large amount of data. To prevent overfitting and improve generalization error, regularization based on, e.g., $\ell_1$- and $\ell_2$-norms of the parameters is applied. Similarly, multiple connections of the network may be pruned to increase sparsity in the network parameters. In this paper, we explore the effects of sparsity promoting $\ell_1$-regularization on training neural networks when only a small training dataset from a high-fidelity model is available. As opposed to standard $\ell_1$-regularization that is known to be inadequate, we consider two variants of $\ell_1$-regularization informed by the parameters of an identical network trained using data from lower-fidelity models of the problem at hand. These bi-fidelity strategies are generalizations of transfer learning of neural networks that uses the parameters learned from a large low-fidelity dataset to efficiently train networks for a small high-fidelity dataset. We also compare the bi-fidelity strategies with two $\ell_1$-regularization methods that only use the high-fidelity dataset. Three numerical examples for propagating uncertainty through physical systems are used to show that the proposed bi-fidelity $\ell_1$-regularization strategies produce errors that are one order of magnitude smaller than those of networks trained only using datasets from the high-fidelity models.

翻译：由于能够准确地代表物理系统模型和输出量的输入之间的功能关系,神经网络在科学应用中已成为替代模型的流行型号。然而,由于这些网络过于光度过强,因此其培训往往需要大量的数据。为了防止超常和改进一般化错误,应用了基于参数(例如$\ell_1美元和$\ell_2美元)的规范化。同样,网络的多个连接可能被调整,以增加网络参数的偏移性。在本文件中,我们探讨了在只有高忠实模型的微小培训数据集组合时,神经网络在培训神经网络中推广 $\ell_1美元-1美元-正规化的影响。相对于已知不充分的标准化标准$\ell_1美元和$\ell_2美元,我们考虑两种基于使用低忠诚模型所培训的相同网络参数的变异种 $_1美元- 更常规化的网络,这些二元级化战略是用高货币- 数字模型的简单化数据转换系统,我们用高货币化的一等化战略来学习数据系统。

相关内容

Networking

关注 22

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

一份简单《图神经网络》教程，28页ppt

专知会员服务

126+阅读 · 2020年8月2日

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

专知会员服务

74+阅读 · 2020年7月6日

【2020关键词提取】基于深度神经网络的关键词提取，Keywords extraction with deep neural network model

专知会员服务

60+阅读 · 2020年5月2日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日