Betweenness centrality is a popular centrality measure with applications in several domains, and whose exact computation is impractical for modern-sized networks. We present SILVAN, a novel, efficient algorithm to compute, with high probability, accurate estimates of the betweenness centrality of all nodes of a graph and a high-quality approximation of the top-k betweenness centralities. SILVAN follows a progressive sampling approach, and builds on novel bounds based on Monte-Carlo Empirical Rademacher Averages, a powerful and flexible tool from statistical learning theory. SILVAN relies on a novel estimation scheme providing non-uniform bounds on the deviation of the estimates of the betweenness centrality of all the nodes from their true values, and a refined characterisation of the number of samples required to obtain a high-quality approximation. Our extensive experimental evaluation shows that SILVAN extracts high-quality approximations while outperforming, in terms of number of samples and accuracy, the state-of-the-art approximation algorithm with comparable quality guarantees.
翻译:中间点是几个领域应用的流行中心尺度,其精确计算对现代规模的网络来说不切实际。我们提供了SILVAN,这是一个创新的、高效的算法,用来以高概率准确估计图中所有节点的中间点之间的中间点,以及高品质的中间点之间的近似值。SILVAN采用渐进的抽样方法,并以基于蒙特-卡洛的Emspiritical Rademacher Access(Monte-Carlo Emporical Rademacher Universitys)的新颖的界限为基础,根据统计学习理论的强大而灵活的工具。 SILVAN依靠一种新颖的估算方法,提供非统一的界限,说明所有节点与真实值之间的中心点之间的估计偏差,以及精确的样本数量,以便获得高质量的近点。我们广泛的实验评估表明,SILVAN在样品数量和准确性方面优于具有可比质量保证的先进近点算法。