The identification of protein-ligand interaction plays a key role in biochemical research and drug discovery. Although deep learning has recently shown great promise in discovering new drugs, there remains a gap between deep learning-based and experimental approaches. Here we propose a novel framework, named AIMEE, integrating AI Model and Enzymology Experiments, to identify inhibitors against 3CL protease of SARS-CoV-2, which has taken a significant toll on people across the globe. From a bioactive chemical library, we have conducted two rounds of experiments and identified six novel inhibitors with a hit rate of 29.41%, and four of them showed an IC50 value less than 3 {\mu}M. Moreover, we explored the interpretability of the central model in AIMEE, mapping the deep learning extracted features to domain knowledge of chemical properties. Based on this knowledge, a commercially available compound was selected and proven to be an activity-based probe of 3CLpro. This work highlights the great potential of combining deep learning models and biochemical experiments for intelligent iteration and expanding the boundaries of drug discovery.
翻译:在生物化学研究和药物发现方面,确定蛋白质和相互作用具有关键作用。虽然最近深层次的学习在发现新药物方面显示出很大的希望,但深层次的学习基础和实验方法之间仍然存在差距。在这里,我们提议了一个名为AIME的新框架,将AI模型和酶学实验结合起来,以识别针对SARS-COV-23CL的3CL蛋白的抑制剂,SARS-COV-2已经给全球人民造成了重大损失。我们从一个生物活性化学图书馆中进行了两轮实验,并确定了六种新抑制剂,其冲击率为29.41%,其中四种显示IC50值低于3 mu}M。此外,我们探讨了AIMEE中心模型的可解释性,绘制了深度学习的特征,以覆盖化学特性的知识领域。根据这一知识,选定并证明一个商业上可用的化合物是3CLPRO的活动探测。这项工作突出了将深层学习模型和生物化学实验结合起来,促进智能重复和扩大药物发现界限的巨大潜力。