The Generalized Independent Set (GIS) problem extends the classical maximum independent set problem by incorporating profits for vertices and penalties for edges. This generalized problem has been identified in diverse applications in fields such as forest harvest planning, competitive facility location, social network analysis, and even machine learning. However, solving the GIS problem in large-scale, real-world networks remains computationally challenging. In this paper, we explore data reduction techniques to address this challenge. We first propose 14 reduction rules that can reduce the input graph with rigorous optimality guarantees. We then present a reduction-driven local search (RLS) algorithm that integrates these reduction rules into the pre-processing, the initial solution generation, and the local search components in a computationally efficient way. The RLS is empirically evaluated on 278 graphs arising from different application scenarios. The results indicates that the RLS is highly competitive -- For most graphs, it achieves significantly superior solutions compared to other known solvers, and it effectively provides solutions for graphs exceeding 260 million edges, a task at which every other known method fails. Analysis also reveals that the data reduction plays a key role in achieving such a competitive performance.
翻译:广义独立集(GIS)问题通过引入顶点收益与边惩罚,扩展了经典的最大独立集问题。这一广义问题已在森林采伐规划、竞争性设施选址、社交网络分析乃至机器学习等多个领域的实际应用中被识别。然而,在大规模现实网络中对GIS问题进行求解,仍面临计算上的挑战。本文探讨了数据归约技术以应对这一挑战。我们首先提出了14条归约规则,这些规则可在严格的最优性保证下对输入图进行简化。随后,我们提出了一种归约驱动局部搜索(RLS)算法,该算法以计算高效的方式将这些归约规则集成到预处理、初始解生成及局部搜索组件中。RLS算法在来自不同应用场景的278个图上进行了实证评估。结果表明,RLS算法具有很强的竞争力——对于大多数图,相较于其他已知求解器,它能获得显著更优的解,并且能有效处理边数超过2.6亿的图,而所有其他已知方法在此类任务上均告失败。分析还表明,数据归约在实现此等竞争性能中起到了关键作用。