This letter proposes a novel deep learning framework (DLF) that addresses two major hurdles in the adoption of deep learning techniques for solving physics-based problems: 1) requirement of the large dataset for training the DL model, 2) consistency of the DL model with the physics of the phenomenon. The framework is generic in nature and can be applied to model a phenomenon from other fields of research too as long as its behaviour is known. To demonstrate the technique, a semi-supervised physics guided neural network (SPGNN) has been developed that predicts I-V characteristics of a gallium nitride-based high electron mobility transistor (GaN HEMT). A two-stage training method is proposed, where in the first stage, the DL model is trained via the unsupervised learning method using the I-V equations of a field-effect transistor as a loss function of the model that incorporates physical behaviors in the DL model and in the second stage, the DL model has been fine-tuned with a very small set of experimental data. The SPGNN significantly reduces the requirement of the training data by more than 80% for achieving similar or better performance than a traditional neural network (TNN) even for unseen conditions. The SPGNN predicts 32.4% of the unseen test data with less than 1% of error and only 0.4% of the unseen test data with more than 10% of error.
翻译:本信提出了一个新的深层次学习框架(DLF),它解决了在采用深深深学习技术解决基于物理的问题时遇到的两大障碍:(1) 培训DL模型需要大型数据集,(2) DL模型与该现象的物理物理相一致。这个框架具有通用性质,只要其行为为人所知,就可以用来模拟其他研究领域的现象。为了展示这一技术,已经开发了一个半监督的物理导神经网络(SPGNNN),它预测了以硝化 ⁇ 为基础的高电动晶体管(GaN HEMT)的I-V特性。提出了两阶段培训方法,在第一阶段,DL模型通过不受监督的学习方法培训,使用外地效应晶体管的I-V方程式作为模型的一种损失函数,将物理行为纳入DL模型和第二阶段,DLNURNM(SGNNNNNN)模型只用非常小的一组实验数据来微调调,其培训数据的要求比NEMOV的80%还要低。