We study the problem of large-scale network embedding, which aims to learn low-dimensional latent representations for network mining applications. Recent research in the field of network embedding has led to significant progress such as DeepWalk, LINE, NetMF, NetSMF. However, the huge size of many real-world networks makes it computationally expensive to learn network embedding from the entire network. In this work, we present a novel network embedding method called "NES", which learns network embedding from a small representative subgraph. NES leverages theories from graph sampling to efficiently construct representative subgraph with smaller size which can be used to make inferences about the full network, enabling significantly improved efficiency in embedding learning. Then, NES computes the network embedding from this representative subgraph, efficiently. Compared with well-known methods, extensive experiments on networks of various scales and types demonstrate that NES achieves comparable performance and significant efficiency superiority.
翻译:我们研究大规模网络嵌入的问题,目的是学习网络采矿应用的低维潜在代表面,最近对网络嵌入领域的研究已经取得了重大的进展,例如深电站、LINE、NetMF、NetSMF。然而,许多真实世界网络的庞大规模使得从整个网络中学习网络嵌入的计算成本很高。在这项工作中,我们提出了一个名为“NES”的新颖的网络嵌入方法,从一个有代表性的小型子集中学习网络嵌入。NES利用从图样取样到高效构建规模较小的代表性子集的理论,可以用来推断整个网络,从而大大提高嵌入学习的效率。然后,NES对网络嵌入这个具有代表性的子集图集进行了高效的计算。与众所周知的方法相比,在各种规模和类型的网络上进行的广泛实验表明,国家空间研究中心取得了相似的性能和显著的效率优势。