A key task in the emerging field of materials informatics is to use machine learning to predict a material's properties and functions. A fast and accurate predictive model allows researchers to more efficiently identify or construct a material with desirable properties. As in many fields, deep learning is one of the state-of-the art approaches, but fully training a deep learning model is not always feasible in materials informatics due to limitations on data availability, computational resources, and time. Accordingly, there is a critical need in the application of deep learning to materials informatics problems to develop efficient transfer learning algorithms. The Bayesian framework is natural for transfer learning because the model trained from the source data can be encoded in the prior distribution for the target task of interest. However, the Bayesian perspective on transfer learning is relatively unaccounted for in the literature, and is complicated for deep learning because the parameter space is large and the interpretations of individual parameters are unclear. Therefore, rather than subjective prior distributions for individual parameters, we propose a new Bayesian transfer learning approach based on the penalized complexity prior on the Kullback-Leibler divergence between the predictive models of the source and target tasks. We show via simulations that the proposed method outperforms other transfer learning methods across a variety of settings. The new method is then applied to a predictive materials science problem where we show improved precision for estimating the band gap of a material based on its structural properties.
翻译:新兴材料信息学领域的一个关键任务是利用机器学习来预测材料的属性和功能。快速和准确的预测模型使研究人员能够更高效地识别或构建具有理想属性的材料。在许多领域,深层次学习是最先进的方法之一,但充分培训深层次学习模式在材料信息学方面并不总是可行的,因为数据可用性、计算资源和时间有限。因此,在对材料信息学问题进行深层次学习以开发高效的传输学习算法方面,迫切需要采用深层次的学习方法来开发材料信息学问题。Bayesian框架对于转让学习是自然的,因为从源数据培训的模型可以在先前分发时为感兴趣的目标任务编码。然而,与许多领域一样,深层次学习模式的深度学习模式在文献中相对缺乏,但由于参数空间很大,对单个参数的解释也不明确。因此,我们建议采用新的巴耶斯转移学习方法,而不是主观的先前分配方法,而之前基于库尔回-利利尔差异的复杂度,因为从源的预测模型和具体目标任务之间在先前的分布中可以编码编码编码编码编码。我们随后通过模拟了一种结构学方法来显示其结构学方法的变异的方法。