Recent advances have shown the success of using reinforcement learning and search to solve NP-hard graph-related tasks, such as Traveling Salesman Optimization, Graph Edit Distance computation, etc. However, it remains unclear how one can efficiently and accurately detect the occurrences of a small query graph in a large target graph, which is a core operation in graph database search, biomedical analysis, social group finding, etc. This task is called Subgraph Matching which essentially performs subgraph isomorphism check between a query graph and a large target graph. One promising approach to this classical problem is the "learning-to-search" paradigm, where a reinforcement learning (RL) agent is designed with a learned policy to guide a search algorithm to quickly find the solution without any solved instances for supervision. However, for the specific task of Subgraph Matching, though the query graph is usually small given by the user as input, the target graph is often orders-of-magnitude larger. It poses challenges to the neural network design and can lead to solution and reward sparsity. In this paper, we propose N-BLS with two innovations to tackle the challenges: (1) A novel encoder-decoder neural network architecture to dynamically compute the matching information between the query and the target graphs at each search state; (2) A Monte Carlo Tree Search enhanced bi-level search framework for training the policy and value networks. Experiments on five large real-world target graphs show that N-BLS can significantly improve the subgraph matching performance.
翻译:最近的进展表明,使用强化学习和搜索成功解决了NP硬图形相关任务,例如旅行销售员优化、图表编辑远程计算等。然而,目前还不清楚如何在大目标图形中高效准确地检测小查询图的发生,该大目标图是图形数据库搜索、生物医学分析、社会团体发现等的核心操作。该任务被称为子图匹配,主要在查询图表和大目标图之间进行亚形检查。对于这一古老问题的一种有希望的方法是“从学习到搜索”模式,在这个模式中,一个强化学习到搜索(RL)代理器与一项学习的政策一起设计,以指导搜索算法快速找到解决方案,而无需解决监管的事例。然而,对于子图匹配的具体任务,尽管用户通常以小于输入方式给出的查询图,但目标图往往更小,对神经网络设计构成挑战,并可以导致解决方案和奖赏。在本文中,我们建议N-BLS将两次创新用于应对挑战: (1) 每一个新搜索数据库搜索网络的高级搜索引擎,以显示动态双轨图的高级搜索结构。