Despite the impressive successes of deep learning approaches for various chemical problems such as property prediction, virtual screening, and de novo molecule design, separately designed models for specific tasks are usually required, and it is often difficult to synergistically combine these models for novel tasks. To address this, here we present a bidirectional molecular foundation model that can be used for both molecular structure and property inferences through a single model, inspired by recent multimodal learning methods such as VLP. Furthermore, thanks to the outstanding structure/property alignment in a common embedding space, experimental results confirm that our method leads to state-of-the-art performance and interpretable attention maps in both multimodal and unimodal tasks, including conditional molecule generation, property prediction, molecule classification, and reaction prediction.
翻译:尽管在财产预测、虚拟筛选和新分子设计等各种化学问题的深层学习方法方面取得了令人印象深刻的成功,但通常需要为具体任务单独设计模型,而且往往难以为新任务协同使用这些模型。为了解决这个问题,我们在这里提出了一个双向分子基础模型,通过一个单一模型,在诸如VLP等最近多式学习方法的启发下,既可用于分子结构和财产推断。 此外,由于在共同嵌入空间中的结构/财产调整突出,实验结果证实,我们的方法导致在多式和单式任务中,包括有条件的分子生成、财产预测、分子分类和反应预测,产生最先进的性能和可解释的注意地图。