The selection of coarse-grained (CG) mapping operators is a critical step for CG molecular dynamics (MD) simulation. It is still an open question about what is optimal for this choice and there is a need for theory. The current state-of-the art method is mapping operators manually selected by experts. In this work, we demonstrate an automated approach by viewing this problem as supervised learning where we seek to reproduce the mapping operators produced by experts. We present a graph neural network based CG mapping predictor called DEEP SUPERVISED GRAPH PARTITIONING MODEL(DSGPM) that treats mapping operators as a graph segmentation problem. DSGPM is trained on a novel dataset, Human-annotated Mappings (HAM), consisting of 1,206 molecules with expert annotated mapping operators. HAM can be used to facilitate further research in this area. Our model uses a novel metric learning objective to produce high-quality atomic features that are used in spectral clustering. The results show that the DSGPM outperforms state-of-the-art methods in the field of graph segmentation. Finally, we find that predicted CG mapping operators indeed result in good CG MD models when used in simulation.
翻译:在这项工作中,我们展示了一种自动化的方法,将这一问题作为我们寻求复制专家制作的绘图操作员的有监督的学习方法来看待。我们展示了一种基于CG分子动态模拟(MD)的关键步骤。它仍然是关于什么是最佳选择的未决问题,需要理论。目前最先进的方法是由专家手工选择的绘图操作员。在这项工作中,我们将这一问题作为我们寻求复制专家制作的绘图操作员的有监督的学习方法来看待。我们展示了一种以光谱集中所使用的高品质原子特征为基础的基于图形网络的CG映像预测器。结果显示DSGPM将绘图操作员作为图解析问题处理。DSGPM在新的数据集、人注解绘图(HAM)方面接受了培训,该数据集由1,206个分子组成,并配有专家附加说明的绘图操作员组成。HAM可用于促进该领域的进一步研究。我们的模型使用一种新的计量学习目标来生成光谱集中所使用的高品质原子特征。结果显示DSGPM在图形断面模型中确实使用了良好的C结果。