将深强化学习应用到HP蛋泰因结构预测模型 (Applying Deep Reinforcement Learning to the HP Model for Protein Structure Prediction)

from arxiv, Published at Physica A: Statistical Mechanics and its Applications, available online 7 December 2022. Extended abstract accepted by the Machine Learning and the Physical Sciences workshop, NeurIPS 2022

A central problem in computational biophysics is protein structure prediction, i.e., finding the optimal folding of a given amino acid sequence. This problem has been studied in a classical abstract model, the HP model, where the protein is modeled as a sequence of H (hydrophobic) and P (polar) amino acids on a lattice. The objective is to find conformations maximizing H-H contacts. It is known that even in this reduced setting, the problem is intractable (NP-hard). In this work, we apply deep reinforcement learning (DRL) to the two-dimensional HP model. We can obtain the conformations of best known energies for benchmark HP sequences with lengths from 20 to 50. Our DRL is based on a deep Q-network (DQN). We find that a DQN based on long short-term memory (LSTM) architecture greatly enhances the RL learning ability and significantly improves the search process. DRL can sample the state space efficiently, without the need of manual heuristics. Experimentally we show that it can find multiple distinct best-known solutions per trial. This study demonstrates the effectiveness of deep reinforcement learning in the HP model for protein folding.

翻译：计算生物物理的一个中心问题是蛋白质结构预测,即找到某个氨基酸序列的最佳折叠。这个问题已经在经典抽象模型HP模型中研究过,即HP模型,该模型的蛋白质以H(疏水)和P(polar)氨基酸的序列为模型,在薄饼上以H(湿)和P(P(polar)氨基酸为模型。目标是找到最大程度的H-H接触的符合性。众所周知,即使在这种降低的环境下,问题也是棘手的(NP-hard)。在这项工作中,我们对二维的HP模型进行深度强化学习(DRLL)。我们实验性地表明,在深度的Q网络上,我们能找到多种已知的HP序列的精密能量。我们发现,基于长期短期记忆(LSTM)结构的DQN可极大地提高RL学习能力,并大大改进搜索过程。DRL可以有效地对州空间进行取样,而不需要手动的超度。我们实验性地表明,在一次试验中,它能够找到多种已知的模型,在深度的模型中找到各种最明显的强化的模型。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【布朗大学David Abel博士论文】A Theory of Abstraction in Reinforcement Learning

专知会员服务

25+阅读 · 2022年3月16日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日