Designing new industrial materials with desired properties can be very expensive and time consuming. The main difficulty is to generate compounds that correspond to realistic materials. Indeed, the description of compounds as vectors of components' proportions is characterized by discrete features and a severe sparsity. Furthermore, traditional generative model validation processes as visual verification, FID and Inception scores are tailored for images and cannot then be used as such in this context. To tackle these issues, we develop an original Binded-VAE model dedicated to the generation of discrete datasets with high sparsity. We validate the model with novel metrics adapted to the problem of compounds generation. We show on a real issue of rubber compound design that the proposed approach outperforms the standard generative models which opens new perspectives for material design optimization.
翻译:设计具有理想特性的新工业材料可能非常昂贵,而且耗费时间。主要困难在于产生与现实材料相对应的化合物。事实上,将化合物描述为组成部分比例的矢量的特征是离散的特征和严重的宽度。此外,传统的基因模型验证过程,如视觉验证、FID和感知分数,是针对图像的定制的,无法在此情况下使用。为了解决这些问题,我们开发了一个原始的Binded-VAE模型,专门用于生成具有高度宽度的离散数据集。我们用适应化合物生成问题的新型指标来验证该模型。我们展示了橡胶化合物设计的实际问题,即拟议方法超过了标准基因模型,为材料设计优化提供了新的视角。