Recent advancements in deep learning-based modeling of molecules promise to accelerate in silico drug discovery. A plethora of generative models is available, building molecules either atom-by-atom and bond-by-bond or fragment-by-fragment. However, many drug discovery projects require a fixed scaffold to be present in the generated molecule, and incorporating that constraint has only recently been explored. Here, we propose MoLeR, a graph-based model that naturally supports scaffolds as initial seed of the generative procedure, which is possible because it is not conditioned on the generation history. Our experiments show that MoLeR performs comparably to state-of-the-art methods on unconstrained molecular optimization tasks, and outperforms them on scaffold-based tasks, while being an order of magnitude faster to train and sample from than existing approaches. Furthermore, we show the influence of a number of seemingly minor design choices on the overall performance.
翻译:分子的深层学习模型的最近进步有望在硅质药物发现中加速。 有很多基因模型可供使用, 建立分子, 要么按原子逐个原子, 捆绑成团, 要么逐个碎裂。 但是, 许多药物发现项目需要固定的脚架, 才能存在于生成的分子中, 并且将这种限制纳入到最近才被探索过。 我们在这里提议以图为基础的模型MoLeR, 这个模型自然支持脚架作为基因化过程的初始种子, 这是可能的, 因为它不以一代历史为条件 。 我们的实验显示, MoLeR 在不受限制的分子优化任务上表现得与最先进的方法相当, 并且比现有方法更快的培训和采样数量级。 此外, 我们显示了一些看似次要的设计选择对总体性的影响 。