Antimicrobial peptide (AMP) is a promising therapy in the treatment of broad-spectrum antibiotics and drug-resistant infections. Recently, an increasing number of researchers have been introducing deep generative models to accelerate AMP discovery. However, current studies mainly focus on sequence attributes and ignore structure information, which is important in AMP biological functions. In this paper, we propose a latent sequence-structure model for AMPs (LSSAMP) with multi-scale VQ-VAE to incorporate secondary structures. By sampling in the latent space, LSSAMP can simultaneously generate peptides with ideal sequence attributes and secondary structures. Experimental results show that the peptides generated by LSSAMP have a high probability of AMP, and two of the 21 candidates have been verified to have good antimicrobial activity. Our model will be released to help create high-quality AMP candidates for follow-up biological experiments and accelerate the whole AMP discovery.
翻译:抗微生物浸泡剂(AMP)是治疗广泛频谱抗生素和抗药性感染的一种很有希望的疗法。最近,越来越多的研究人员开始采用深基因模型来加速AMP的发现。然而,目前的研究主要侧重于序列属性和忽略结构信息,这对AMP生物功能很重要。我们在本文件中为AMP(LSSAMP)提出了一种潜在的序列结构模型,该模型具有多级VQ-VAE,可以纳入二级结构。通过在潜藏空间取样,LSSAMP可以同时产生带有理想序列属性和二级结构的peptides。实验结果表明,LSSAMP产生的pids具有很高的AMP概率,21名候选人中有2人已被核实具有良好的抗微生物活动。我们的模型将发布,以帮助培养高质量的AMP候选人进行后续生物学实验并加速整个AMP的发现。