Discovering new materials is a long-standing challenging task that is crucial to the progress of human society. Conventional approaches based on trial-and-error experiments and computational simulations are labor-intensive or costly with success heavily depending on experts' heuristic knowledge. Recently, deep generative models have been proposed for generative design of materials by learning implicit knowledge from known materials datasets. However, these models are either applicable to a specific material system or the performance is low due to their failure to incorporate physical rules into their model training process. Here, we propose a deep learning based Physics Guided Crystal Generative Model (PGCGM) for efficient generative materials design with high structural diversity (up to 20 different space groups). The high performance of our model manifests its capability to capture and exploit the symmetric constraints of crystals and the pairwise atomic distance constraints among neighbor atoms. Using data augmentation and spatial atom clustering and merging, our PGCGM model increases the overall generation validity performance by more than 700\% compared to FTCP, one of the state-of-the-art structure generators and by more than 45\% compared to our previous CubicGAN model. The newly generated crystal materials also show higher quality in terms of atomic spatial distribution and composition diversity. We further validated the new crystal structures by Density Functional Theory (DFT) calculations. 1,869 materials out of 2,000 were successfully optimized, of which 39.6\% have negative formation energy and 5.3\% have energy-above-hull less than 0.25 eV/atom, indicating their thermodynamic stability and potential synthesizability. The 1,869 crystal structures have been deposited to the Carolina Materials Database \url{www.carolinamatdb.org}.
翻译:发现新材料是一项长期的艰巨任务,对人类社会进步至关重要。基于试验和机变实验和计算模拟的常规方法是劳动密集型或成本高昂的,其成功与否在很大程度上取决于专家的厌食知识。最近,通过从已知材料数据集学习隐含知识,为材料的基因化设计提出了深基因模型。然而,这些模型要么适用于特定的材料系统,要么由于未能将物理规则纳入模型培训过程,性能较低。在这里,我们提议了一个基于深层次的基于数据库的物理引导晶体生成模型(PGCGM),用于结构多样性高的高效基因化材料设计(高达20个不同的空间组 ) 。我们模型的高性能显示了其捕捉和利用晶体和近邻原子间对称原子距离限制的对称性限制的能力。使用数据增强和空间原子组合和合并,我们的PGCGGM模型将整个生成的有效性提高了700%以上。 与TCP相比,一个基于状态的晶质结构发电机和超过45°T, 也显示了我们之前的Slibalal-DFA结构的能量结构。我们通过以前的CUBIG-deal-deal-de-deal-de-de-deal-de-de-de-deal sal maild-deal smal destalal exmalviolal destaldalal ex ex ex ex ex ex ex ex ex ex ex ex exmational ex ex exmmmlational exmal exal exmationalviolviolviolviolvicuildal ex ex exal ex exmum ex exal exual ex exald exal exal ex exal exal exal exal exal exal ex ex exal exal exal exal exaldaldaldaldaldal ex ex ex exal exal exal exal exmal exmal ex ex exal exal exmal exal exal exmal ex ex ex ex ex ex ex