Betweenness centrality (BC) is one of the most used centrality measures for network analysis, which seeks to describe the importance of nodes in a network in terms of the fraction of shortest paths that pass through them. It is key to many valuable applications, including community detection and network dismantling. Computing BC scores on large networks is computationally challenging due to high time complexity. Many approximation algorithms have been proposed to speed up the estimation of BC, which are mainly sampling-based. However, these methods are still prone to considerable execution time on large-scale networks, and their results are often exacerbated when small changes happen to the network structures. In this paper, we focus on identifying nodes with high BC in a graph, since many application scenarios are built upon retrieving nodes with top-k BC. Different from previous heuristic methods, we turn this task into a learning problem and design an encoder-decoder based framework to resolve the problem. More specifcally, the encoder leverages the network structure to encode each node into an embedding vector, which captures the important structural information of the node. The decoder transforms the embedding vector for each node into a scalar, which captures the relative rank of this node in terms of BC. We use the pairwise ranking loss to train the model to identify the orders of nodes regarding their BC. By training on small-scale networks, the learned model is capable of assigning relative BC scores to nodes for any unseen networks, and thus identifying the highly-ranked nodes. Comprehensive experiments on both synthetic and real-world networks demonstrate that, compared to representative baselines, our model drastically speeds up the prediction without noticeable sacrifce in accuracy, and outperforms the state-of-the-art by accuracy on several large real-world networks.
翻译:中间中心值( BC) 是网络分析中最常用的中心度衡量标准之一, 它试图描述节点在网络中的重要性, 即通过网络结构发生小变化时, 其速度偏差的重要性 。 在本文中, 我们侧重于在图表中找到不列颠哥伦比亚高点的节点, 因为许多应用方案是在使用顶部不列颠不列颠的节点后建立的 。 在大型网络中计算不列颠不列颠的分分数, 与以往的超时复杂性方法不同, 计算在计算过程中具有挑战性 。 许多近似算算法建议加快对不列颠纽的估算, 主要是基于抽样的计算。 然而, 这些方法仍然容易在大型网络中执行相当长的时间, 并且当网络出现小的节点时, 其结果往往会恶化 。 我们的重点是在图表中找到不列的不列颠峰值的点, 将相对的分数网络转换成直线值网络 。 因此, 我们的任务会变成一个学习的直径直径直值网络 。