Machine learning has been widely adopted to accelerate the screening of materials. Most existing studies implicitly assume that the training data are generated through a deterministic, unbiased process, but this assumption might not hold for the simulation of some complex materials. In this work, we aim to screen amorphous polymer electrolytes which are promising candidates for the next generation lithium-ion battery technology but extremely expensive to simulate due to their structural complexity. We demonstrate that a multi-task graph neural network can learn from a large amount of noisy, biased data and a small number of unbiased data and reduce both random and systematic errors in predicting the transport properties of polymer electrolytes. This observation allows us to achieve accurate predictions on the properties of complex materials by learning to reduce errors in the training data, instead of running repetitive, expensive simulations which is conventionally used to reduce simulation errors. With this approach, we screen a space of 6247 polymer electrolytes, orders of magnitude larger than previous computational studies. We also find a good extrapolation performance to the top polymers from a larger space of 53362 polymers and 31 experimentally-realized polymers. The strategy employed in this work may be applicable to a broad class of material discovery problems that involve the simulation of complex, amorphous materials.
翻译:为加速筛选材料,广泛采用了机器学习方法,大多数现有研究暗含地假定培训数据是通过确定性、不偏倚的过程产生的,但这一假设可能无法用于模拟某些复杂材料。在这项工作中,我们的目标是筛选可成为下一代锂离子电池技术有希望候选但因其结构复杂性而模拟费用极高的无定型聚合电解解液。我们证明多任务图神经网络可以从大量噪音、偏差的数据和少量不偏倚的数据中学习,并减少在预测聚合电解液运输特性方面的随机和系统错误。这种观察使我们能够通过学习减少培训数据中的错误,而不是进行重复的、昂贵的模拟来准确预测复杂材料的特性,而通常用来减少模拟错误。我们用这种方法来筛选6247个聚合电解液的空间,其规模大于以往的计算研究。我们还发现,从53362个聚合物和31个实验性化材料的更大空间中,对顶级聚合物的外推性表现良好,并减少了随机性和系统性错误。这种观察使我们能够通过学习减少培训数据中的错误,而不是重复的、昂贵的模拟,而通常用来减少模拟错误。我们所应用的624的聚合物材料的策略,可以用来筛选成一个复杂的研究。