Inverse molecular design, i.e., designing molecules with specific target properties, can be posed as an optimization problem. High-dimensional optimization tasks in the natural sciences are commonly tackled via population-based metaheuristic optimization algorithms such as evolutionary algorithms. However, expensive property evaluation, which is often required, can limit the widespread use of such approaches as the associated cost can become prohibitive. Herein, we present JANUS, a genetic algorithm that is inspired by parallel tempering. It propagates two populations, one for exploration and another for exploitation, improving optimization by reducing expensive property evaluations. Additionally, JANUS is augmented by a deep neural network that approximates molecular properties via active learning for enhanced sampling of the chemical space. Our method uses the SELFIES molecular representation and the STONED algorithm for the efficient generation of structures, and outperforms other generative models in common inverse molecular design tasks achieving state-of-the-art performance.
翻译:反分子设计,即设计具有特定目标特性的分子,可以作为一个优化问题提出。自然科学中的高维优化任务通常通过基于人口的计量优化算法(如进化算法)来解决。然而,通常需要的昂贵的财产评估可以限制广泛使用这类方法,因为相关的成本会变得令人望而却步。在这里,我们介绍JANUS,一种由平行的调情所启发的遗传算法。它传播两种人口,一种用于勘探,另一种用于开发,通过减少昂贵的财产评估改进优化。此外,JANUS还利用一个深层神经网络,通过积极学习加强化学空间的采样来近似分子特性。我们的方法使用SELFIES分子代表法和STONED算法来高效地生成结构,在共同的反分子设计任务中超越其他基因模型,实现最先进的性能。