Deep learning is a powerful tool for solving nonlinear differential equations, but usually, only the solution corresponding to the flattest local minimizer can be found due to the implicit regularization of stochastic gradient descent. This paper proposes a network-based structure probing deflation method to make deep learning capable of identifying multiple solutions that are ubiquitous and important in nonlinear physical models. First, we introduce deflation operators built with known solutions to make known solutions no longer local minimizers of the optimization energy landscape. Second, to facilitate the convergence to the desired local minimizer, a structure probing technique is proposed to obtain an initial guess close to the desired local minimizer. Together with neural network structures carefully designed in this paper, the new regularized optimization can converge to new solutions efficiently. Due to the mesh-free nature of deep learning, the proposed method is capable of solving high-dimensional problems on complicated domains with multiple solutions, while existing methods focus on merely one or two-dimensional regular domains and are more expensive in operation counts. Numerical experiments also demonstrate that the proposed method could find more solutions than exiting methods.
翻译:深层学习是解决非线性差异方程式的有力工具,但通常,只有由于隐含的随机梯度梯度下降的正规化,才能找到与局部最小化标准相对应的解决方案。本文件建议采用基于网络的结构,探究通缩方法,使深层学习能够找出在非线性物理模型中普遍存在且重要的多种解决方案。首先,我们引入了有已知解决方案的通缩操作者,使已知的解决方案不再成为优化能源景观的本地最小化工具。第二,为了便利与所希望的本地最小化器的趋同,建议了一种结构测试技术,以获得与所希望的本地最小化器相近的初步猜测。与本文件精心设计的神经网络结构一起,新的常规优化可以有效地与新的解决方案汇合。由于深层学习的无网状性质,拟议方法能够解决复杂领域的高维度问题,同时现有方法仅侧重于一或二维常规域,操作成本更高。数字实验还表明,拟议方法可以找到比退出方法更多的解决方案。