项目名称: 基于结构与序列信息的蛋白质-配体结合位点的预测
项目编号: No.11501306
项目类型: 青年科学基金项目
立项/批准年度: 2016
项目学科: 数理科学和化学
项目作者: 杨建益
作者单位: 南开大学
项目金额: 18万元
中文摘要: 蛋白质是细胞中重要的生物大分子,它们通过与配体的相互作用来执行其生物学功能。因此对蛋白质-配体结合位点的预测能为研究蛋白质功能提供重要依据。蛋白质-配体结合位点的预测方法主要包括基于模板的预测方法和从头预测方法。这两类方法都存在弊端。其中,基于模板的预测方法在未识别出同源模板的情况下基本失效;而从头预测方法只能适用于少数几类特殊的配体。本项目通过利用结构与序列信息,开发新的蛋白质-配体结合位点的预测方法,以解决现有方法的弊端。这主要包括以下三个方面的研究:1)对现有配体进行统计与聚类分析,并利用序列进化信息,对出现频率较高的配体开发相应的从头预测算法;2)开发局部结构比对算法,通过与全局结构比对的结合来提高模板识别的覆盖率,以降低基于模板的预测方法失效的概率;3)通过对大量预测结果的系统分析,探究综合多种方法的最优方案。据此建立一个用于蛋白质-配体结合位点预测的综合算法。
中文关键词: 蛋白质结构;蛋白质-配体相互作用;配体结合位点;蛋白质结构比对;蛋白质功能预测
英文摘要: Proteins are important macromolecules in cells. They perform their biological function through interaction with ligands. Therefore, the prediction of protein-ligand binding sites can offer valuable clues for protein functional studies. In general, the methods for the prediction of protein-ligand binding sites can be divided into two groups: template-based predictors and ab-initio predictors. These methods have the following disadvantages. Template-based predictors do not work well when homologous templates are not recognized; while ab-initio predictors only work for several special kinds of ligands. In this project, we will use the structure and sequence information to develop new methods for protein-ligand binding sites prediction, aiming to solve the problems of current methods. The study includes three major aspects. 1) Perform statistical and clustering analysis on existing ligands. Using the sequence evolutionary information, develop a new ab-initio predictor for the frequently occurring ligands, solving the limitation of current ab-initio predictors. 2) Develop a new algorithm for local structure alignment and combine it with global structure alignment to improve the coverage of template recognition, reducing the failure rate of template-based predictors. 3) Investigate the best way to combine multiple predictors by systematic analysis of existing prediction results. Based on the result of this investigation, develop a meta-algorithm for the prediction of protein-ligand binding sites.
英文关键词: Protein structure;Protein-ligand interaction;Ligand-binding sites;Protein structure alignment;Protein function prediction