With the rapid development of biomedical software and hardware, a large amount of relational data interlinking genes, proteins, chemical components, drugs, diseases, and symptoms has been collected for modern biomedical research. Many graph-based learning methods have been proposed to analyze such type of data, giving a deeper insight into the topology and knowledge behind the biomedical data, which greatly benefit to both academic research and industrial application for human healthcare. However, the main difficulty is how to handle high dimensionality and sparsity of the biomedical graphs. Recently, graph embedding methods provide an effective and efficient way to address the above issues. It converts graph-based data into a low dimensional vector space where the graph structural properties and knowledge information are well preserved. In this survey, we conduct a literature review of recent developments and trends in applying graph embedding methods for biomedical data. We also introduce important applications and tasks in the biomedical domain as well as associated public biomedical datasets.
翻译:随着生物医学软件和硬件的迅速发展,为现代生物医学研究收集了大量关联数据,将基因、蛋白质、化学成分、药物、疾病和症状联系起来,为现代生物医学研究收集了大量相关数据,提出了许多基于图表的学习方法,以分析这类数据,更深入地了解生物医学数据背后的地形学和知识,这对学术研究和人类保健工业应用都大有裨益。然而,主要困难在于如何处理生物医学图的高维度和广度。最近,图表嵌入方法为处理上述问题提供了切实有效的方法。它把基于图表的数据转换成一个低维矢量的低维量空间,其中图表结构属性和知识信息得到妥善保存。在这次调查中,我们对生物医学数据应用图形嵌入方法的最新动态和趋势进行了文献审查。我们还介绍了生物医学领域的重要应用和任务,以及相关的公共生物医学数据集。