Molecular conformation generation aims to generate three-dimensional coordinates of all the atoms in a molecule and is an important task in bioinformatics and pharmacology. Previous methods usually first predict the interatomic distances, the gradients of interatomic distances or the local structures (e.g., torsion angles) of a molecule, and then reconstruct its 3D conformation. How to directly generate the conformation without the above intermediate values is not fully explored. In this work, we propose a method that directly predicts the coordinates of atoms: (1) the loss function is invariant to roto-translation of coordinates and permutation of symmetric atoms; (2) the newly proposed model adaptively aggregates the bond and atom information and iteratively refines the coordinates of the generated conformation. Our method achieves the best results on GEOM-QM9 and GEOM-Drugs datasets. Further analysis shows that our generated conformations have closer properties (e.g., HOMO-LUMO gap) with the groundtruth conformations. In addition, our method improves molecular docking by providing better initial conformations. All the results demonstrate the effectiveness of our method and the great potential of the direct approach. The code is released at https://github.com/DirectMolecularConfGen/DMCG
翻译:分子相容生成旨在生成分子中所有原子的三维坐标,这是生物信息学和药理学方面的一项重要任务。以前的方法通常首先预测分子的跨原子距离、跨原子距离梯度或本地结构(例如,透光角度),然后重建其3D一致性。没有充分探索如何在没有上述中间值的情况下直接生成符合。在这项工作中,我们提出一种直接预测原子坐标的方法:(1)损失功能是无法对对正数原子坐标和对称原子的变换进行恒变的;(2)新提出的模型通常适应性地综合一个分子的联结和原子信息或本地结构(例如,透光度角度),并迭接地改进所生成的对立的坐标。我们的方法在GEOM-QM9和GEOM-Drugs数据集中取得了最佳结果。进一步的分析表明,我们产生的对原子的相容性能(例如,HOMO-LUMO差距)与地光度D的兼容性。此外,我们提出的新模型通过提供更佳的初始/光谱化方法改进了我们数据库的模型。