Heterogeneous networks are large graphs consisting of different types of nodes and edges. They are an important category of complex networks, but the process of knowledge extraction and relations discovery from these networks are so complicated and time-consuming. Moreover, the scale of these networks is steadily increasing. Thus, scalable and accurate methods are required for efficient knowledge extraction. In this paper, two distributed label propagation algorithms, namely DHLP-1 and DHLP-2, in the heterogeneous networks have been introduced. The Apache Giraph platform is employed which provides a vertex-centric programming model for designing and running distributed graph algorithms. Complex heterogeneous networks have many examples in the real world and are widely used today for modeling complicated processes. Biological networks are one of such networks. As a case study, we have measured the efficiency of our proposed DHLP-1 and DHLP-2 algorithms on a biological network consisting of drugs, diseases, and targets. The subject we have studied in this network is drug repositioning, aimed at saving both time and cost by suggesting new indications for the current drugs. We compared the proposed algorithms with similar non-distributed versions of them namely MINProp and Heter-LP. The experiments revealed that the runtime of the algorithms has decreased in the distributed versions rather than non-distributed ones dramatically. The effectiveness of our proposed algorithms against other algorithms is supported through statistical analysis of 10-fold cross-validation as well as experimental analysis.
翻译:由不同类型节点和边缘组成的大图层网络,是不同类型节点和边缘组成的大图,是复杂网络的一个重要类别,但从这些网络中提取知识和发现关系的过程如此复杂和耗时。此外,这些网络的规模正在稳步扩大。因此,为了高效率地提取知识,需要采用可缩放和准确的方法。在本文中,引入了两个分布式标签传播算法,即多种网络中的DHLP-1和DHLP-2。Apache Giraph 平台是用来为设计和运行分布式图表算法提供跨脊椎中心编程模型的。复杂的混合网络在现实世界中有许多实例,而且今天广泛用于模拟复杂过程。生物网络是这类网络中的一种。我们测量了我们提议的DHLP-1和DHLP-2算法的效率,在由毒品、疾病和目标组成的生物网络中,我们所研究的主题是药物支持的重新定位,目的是节省时间和成本,为当前药物提供新的标识。我们比较了拟议的算法,在实际世界里有许多例子,而拟议的算法与类似的非分配式的统计性算法分析是他所显示的驱动式,而不是驱动式分析,通过驱动式分析,通过驱动式的推算法,而不是驱动式的推算法的推式的推式的推式的推算法,通过不断变式的推式的推式的推式的推式的推式的推式的推式的推式的推式的推式的推式分析,它。