Network analysis has played a key role in knowledge discovery and data mining. In many real-world applications in recent years, we are interested in mining multilayer networks, where we have a number of edge sets called layers, which encode different types of connections and/or time-dependent connections over the same set of vertices. Among many network analysis techniques, dense subgraph discovery, aiming to find a dense component in a network, is an essential primitive with a variety of applications in diverse domains. In this paper, we introduce a novel optimization model for dense subgraph discovery in multilayer networks. Our model aims to find a stochastic solution, i.e., a probability distribution over the family of vertex subsets, rather than a single vertex subset, whereas it can also be used for obtaining a single vertex subset. For our model, we design an LP-based polynomial-time exact algorithm. Moreover, to handle large-scale networks, we also devise a simple, scalable preprocessing algorithm, which often reduces the size of the input networks significantly and results in a substantial speed-up. Computational experiments demonstrate the validity of our model and the effectiveness of our algorithms.
翻译:在知识发现和数据挖掘方面,网络分析在知识发现和数据挖掘方面发挥了关键作用。在最近几年的许多现实世界应用中,我们感兴趣的是采矿多层网络,其中我们有许多称为层的边缘数据集,它们编码了不同类型的连接和(或)在同一组脊椎上的时间依赖连接。在许多网络分析技术中,密集的子图发现,旨在在网络中找到一个密集的组件,是一个基本原始,在不同领域有各种应用。在本文中,我们为多层网络中的密集子图发现引入了一个新型优化模型。我们的模型旨在找到一种随机的解决方案,即,在顶层子组的家族中进行概率分布,而不是单一的顶层子组,而也可以用来获取单一的顶端子组。对于我们的模型,我们设计了一个基于LP的多元时精确算法。此外,为了处理大型网络,我们还设计了一个简单、可缩放的预处理算法,这往往大大降低输入网络的规模,并在快速上的结果。