项目名称: 基于不可约序关系的复杂疾病诊断基因模式发现技术研究
项目编号: No.61272182
项目类型: 面上项目
立项/批准年度: 2013
项目学科: 自动化技术、计算机技术
项目作者: 赵宇海
作者单位: 东北大学
项目金额: 81万元
中文摘要: 诊断基因是指与特定疾病表型密切相关的基因,通常具有很强的疾病标识能力。诊断基因的发现对探究疾病成因及疾病的诊治、预防至关重要,已成为当前生物信息学领域的研究热点和重点之一。现有方法忽略了基因间的相互关系信息,探究基因交互与疾病关联的能力较弱。本研究提出利用基因表达值间的序关系发现诊断基因模式的新思路,并引入不可约序关系体现基因的交互,用更少的基因获得更准确的疾病诊断。本研究侧重于设计新的诊断基因模式及高效、准确的发现算法。主要内容包括:微阵列数据的序列化模型构建、基于不可约序关系的有监督诊断基因模式发现、基于不可约序关系的无监督诊断基因模式发现、"噪音"环境中基于不可约序关系的诊断基因模式发现、基于协同不可约序关系的诊断基因模式精简等。研究成果对提高诊断准确率、缩小候选基因范围、降低诊断费用,都具有非常重要的实际意义,并且能为考察疾病成因提供新视角。具有较高的理论研究价值和广阔的应用前景。
中文关键词: 数据挖掘;不可约序关系;特征提取;诊断基因;基因表达数据
英文摘要: Diagnostic genes refer to the genes closely related to a specific disease phenotype, the power of which to distinguish between different classes are often high. Since discovering diagnostic genes is highly important for exploring the pathogeny, diagnosis, treatment and prevention of disease, it has been one of the hottest and the most important topics in bioinformatics. Most of the existing methods perform diagonistic gene discovery under the assumption of the independence among genes or only consider the combinational discriminative power of genes. That is, the abundant information of the interrelationship among the selected genes is ignored. This study proposes the novel idea that incorporating the ordering information of gene expression values into diagnostic gene discovery and introduces the novel conception of non-redundant discrimative sequence. As a result, it is enabled to achieve higher accuracy of disease diagnosis using fewer genes. This study focuses on the design of the novel and biologically explicable diagnostic gene pattern and the development of the effective and efficient discovery algorithms. The main contents of this research include: construction of microarray data sequence model, unsupervised discovery of diagnostic gene pattern based on non-reducible sequence, supervised discovery of diagn
英文关键词: data mining;non-reducible sequence;feature selection;diagnostic gene;gene expression data