Due to their disordered structure, glasses present a unique challenge in predicting the composition-property relationships. Recently, several attempts have been made to predict the glass properties using machine learning techniques. However, these techniques have the limitations, namely, (i) predictions are limited to the components that are present in the original dataset, and (ii) predictions towards the extreme values of the properties, important regions for new materials discovery, are not very reliable due to the sparse datapoints in this region. To address these challenges, here we present a low complexity neural network (LCNN) that provides improved performance in predicting the properties of oxide glasses. In addition, we combine the LCNN with physical and chemical descriptors that allow the development of universal models that can provide predictions for components beyond the training set. By training on a large dataset (~50000) of glass components, we show the LCNN outperforms state-of-the-art algorithms such as XGBoost. In addition, we interpret the LCNN models using Shapely additive explanations to gain insights into the role played by the descriptors in governing the property. Finally, we demonstrate the universality of the LCNN models by predicting the properties for glasses with new components that were not present in the original training set. Altogether, the present approach provides a promising direction towards accelerated discovery of novel glass compositions.
翻译:玻璃杯由于结构混乱,在预测构成-财产关系方面是一个独特的挑战。最近,曾几次尝试利用机器学习技术预测玻璃特性,但这些技术有其局限性,即:(一) 预测限于原始数据集中存在的部件,(二) 预测特性的极端值,新材料发现的重要区域,由于该区域数据点稀少,因此不十分可靠。为了应对这些挑战,我们提出了低复杂性神经网络(LCNN),在预测氧化玻璃特性方面提供更好的性能。此外,我们把LCNN与物理和化学描述符结合起来,以便开发通用模型,为培训组以外的部件提供预测。通过对大型数据集的培训(~5000玻璃部件),我们展示了LCNN超越像XGBoost这样的先进算法。此外,我们用沙培加加法解释LCNN模型,以了解目前玻璃杯描述器在预测当前特性中扮演的角色。最后,我们展示了目前GAR图解解剖模型在预测原属性方面没有前景的新方向。