Deep Learning (DL), in particular deep neural networks (DNN), by design is purely data-driven and in general does not require physics. This is the strength of DL but also one of its key limitations when applied to science and engineering problems in which underlying physical properties (such as stability, conservation, and positivity) and desired accuracy need to be achieved. DL methods in their original forms are not capable of respecting the underlying mathematical models or achieving desired accuracy even in big-data regimes. On the other hand, many data-driven science and engineering problems, such as inverse problems, typically have limited experimental or observational data, and DL would overfit the data in this case. Leveraging information encoded in the underlying mathematical models, we argue, not only compensates missing information in low data regimes but also provides opportunities to equip DL methods with the underlying physics and hence obtaining higher accuracy. This short communication introduces several model-constrained DL approaches (including both feed-forward DNN and autoencoders) that are capable of learning not only information hidden in the training data but also in the underlying mathematical models to solve inverse problems. We present and provide intuitions for our formulations for general nonlinear problems. For linear inverse problems and linear networks, the first order optimality conditions show that our model-constrained DL approaches can learn information encoded in the underlying mathematical models, and thus can produce consistent or equivalent inverse solutions, while naive purely data-based counterparts cannot.
翻译:从设计上看,深度学习(DL),特别是深心神经网络(DNN),从设计上看,纯粹是数据驱动的,一般不需要物理学。这是DL的力量,但在应用到基本物理特性(如稳定、保存和活性)和预期准确性需要实现的科学和工程问题时,DL也是其中的关键限制之一。最初形式的DL方法无法尊重基本数学模型,甚至大数据系统中也无法达到预期的准确性。另一方面,许多数据驱动的科学和工程问题,例如反向问题,通常只有有限的实验或观测数据,DL将在此情况下过度适应数据。利用基本数学模型编码的信息,不仅弥补低数据系统中缺失的信息,而且提供机会使DL方法具备基础物理学,从而获得更高的准确性。这种简短的沟通引入了几种模式上受限制的DL方法(包括饲料前DNN和自动编码),这些方法不仅无法学习培训数据中隐藏的信息,而且无法学习基础数学模型中的等同数据,DL将在此情况下超常性数学模型,因此,我们目前和直线性信息的制定过程中都有问题。