Medical diagnosis is the process of making a prediction of the disease a patient is likely to have, given a set of symptoms and observations. This requires extensive expert knowledge, in particular when covering a large variety of diseases. Such knowledge can be coded in a knowledge graph -- encompassing diseases, symptoms, and diagnosis paths. Since both the knowledge itself and its encoding can be incomplete, refining the knowledge graph with additional information helps physicians making better predictions. At the same time, for deployment in a hospital, the diagnosis must be explainable and transparent. In this paper, we present an approach using diagnosis paths in a medical knowledge graph. We show that those graphs can be refined using latent representations with RDF2vec, while the final diagnosis is still made in an explainable way. Using both an intrinsic as well as an expert-based evaluation, we show that the embedding-based prediction approach is beneficial for refining the graph with additional valid conditions.
翻译:医学诊断是预测病人可能患的疾病的过程,如果有一系列症状和观察,这需要广泛的专家知识,特别是在涉及多种疾病时。这种知识可以编成知识图表 -- -- 包括疾病、症状和诊断路径。由于知识本身及其编码都不完整,用更多信息来完善知识图表有助于医生作出更好的预测。与此同时,为了在医院进行诊断,必须解释和透明。在本文中,我们用医学知识图表中的诊断路径来介绍一种方法。我们用RDF2vec的隐形图解析表明,这些图表是可以改进的,而最后的诊断仍然以可以解释的方式进行。我们用内在的和专家的评估来表明,基于嵌入的预测方法有利于用其他有效条件对图表进行精炼。