Ontology-based approaches for predicting gene-disease associations include the more classical semantic similarity methods and more recently knowledge graph embeddings. While semantic similarity is typically restricted to hierarchical relations within the ontology, knowledge graph embeddings consider their full breadth. However, embeddings are produced over a single graph and complex tasks such as gene-disease association may require additional ontologies. We investigate the impact of employing richer semantic representations that are based on more than one ontology, able to represent both genes and diseases and consider multiple kinds of relations within the ontologies. Our experiments demonstrate the value of employing knowledge graph embeddings based on random-walks and highlight the need for a closer integration of different ontologies.
翻译:以本体学为基础的预测基因疾病协会的方法包括比较传统的语义相似的方法和最近的知识图嵌入。虽然语义相似通常限于本体学内的等级关系,但知识图嵌入考虑它们的全部广度。但是,嵌入是通过单一的图表产生的,基因疾病协会等复杂任务可能需要额外的本体学。我们调查了使用基于不止一种本体学、能够代表基因和疾病并考虑本体学内多种关系的较丰富的语义表层结构的影响。我们的实验表明使用基于随机行走的知识图嵌入的价值,并强调需要更密切地整合不同本体学。