Accuracy of molecular dynamics simulations depends crucially on the interatomic potential used to generate forces. The gold standard would be first-principles quantum mechanics (QM) calculations, but these become prohibitively expensive at large simulation scales. Machine learning (ML) based potentials aim for faithful emulation of QM at drastically reduced computational cost. The accuracy and robustness of an ML potential is primarily limited by the quality and diversity of the training dataset. Using the principles of active learning (AL), we present a highly automated approach to dataset construction. The strategy is to use the ML potential under development to sample new atomic configurations and, whenever a configuration is reached for which the ML uncertainty is sufficiently large, collect new QM data. Here, we seek to push the limits of automation, removing as much expert knowledge from the AL process as possible. All sampling is performed using MD simulations starting from an initially disordered configuration, and undergoing non-equilibrium dynamics as driven by time-varying applied temperatures. We demonstrate this approach by building an ML potential for aluminum (ANI-Al). After many AL iterations, ANI-Al teaches itself to predict properties like the radial distribution function in melt, liquid-solid coexistence curve, and crystal properties such as defect energies and barriers. To demonstrate transferability, we perform a 1.3M atom shock simulation, and show that ANI-Al predictions agree very well with DFT calculations on local atomic environments sampled from the nonequilibrium dynamics. Interestingly, the configurations appearing in shock appear to have been well sampled in the AL training dataset, in a way that we illustrate visually.
翻译:分子动态模拟的精度取决于用于生成力的跨原子体潜力。 金本位标准将是第一原则的量子力学计算(QM)计算,但在大型模拟规模中,这种计算成本将变得极其昂贵。基于机器的学习(ML)潜力旨在以大幅降低计算成本的方式忠实地模拟QM。ML潜力的精度和坚固性主要受培训数据集的质量和多样性的限制。使用积极学习(AL)的原则,我们展示了一种高度自动化的数据集构建方法。这个战略是利用正在开发的 ML 潜力来抽样新的原子配置,而一旦达到ML不确定性足够大的配置,就会变得极其昂贵。在这里,基于机器学习(MML)的潜力旨在忠实地以大幅降低计算成本的方式模拟QMM。所有的ML潜力都是从最初的混乱配置开始的MD模拟进行,并且正在经历非平衡的动态动力动力动力变化,我们展示了ML的潜力,从新的原子结构结构结构结构结构中, 并展示了在模型的精确性分析中, 也展示了我们所演化的轨道的稳定性, 。