Many network applications can be formulated as NP-hard combinatorial optimization problems of community detection (CD). Due to the NP-hardness, to balance the CD quality and efficiency remains a challenge. Most existing CD methods are transductive, which are independently optimized only for the CD on a single graph. Some of these methods use advanced machine learning techniques to obtain high-quality CD results but usually have high complexity. Other approaches use fast heuristic approximation to ensure low runtime but may suffer from quality degradation. In contrast to these transductive methods, we propose an alternative inductive community detection (ICD) method across graphs of a system or scenario to alleviate the NP-hard challenge. ICD first conducts the offline training of an adversarial dual GNN on historical graphs to capture key properties of the system. The trained model is then directly generalized to new unseen graphs for online CD without additional optimization, where a better trade-off between quality and efficiency can be achieved. ICD can also capture the permutation invariant community labels in the offline training and tackle the online CD on new graphs with non-fixed number of nodes and communities. Experiments on a set of benchmarks demonstrate that ICD can achieve a significant trade-off between quality and efficiency over various baselines.
翻译:许多网络应用程序可以作为社区探测(CD)的NP硬组合优化问题来设计。由于NP硬性,平衡CD质量和效率仍然是一项挑战。现有的CD方法大多是传输方法,仅对光盘采用独立优化的单一图形。其中一些方法使用先进的机器学习技术来获得高质量的CD结果,但通常具有很高的复杂性。其他方法使用快速超速近似近似以确保低运行时间,但可能受到质量退化的影响。与这些转导方法不同,我们提议了一种在系统图或情景图中跨图解社区诱导检测(ICD)的替代方法,以减轻NP硬挑战。ICD首先在历史图中进行对对对对立双双GNN的离线培训,以捕捉系统的关键特性。经过培训的模式随后直接推广为在线CD的新看不见的图形,而无需进一步优化,从而可以实现质量和效率之间的更好的权衡。ICD还可以在离线培训中捕获变量社区标签,并在新图上处理有非固定质量基准的光盘,可以展示各种无贸易质量的测试。