Complex logical query answering (CLQA) is a recently emerged task of graph machine learning that goes beyond simple one-hop link prediction and solves a far more complex task of multi-hop logical reasoning over massive, potentially incomplete graphs in a latent space. The task received a significant traction in the community; numerous works expanded the field along theoretical and practical axes to tackle different types of complex queries and graph modalities with efficient systems. In this paper, we provide a holistic survey of CLQA with a detailed taxonomy studying the field from multiple angles, including graph types (modality, reasoning domain, background semantics), modeling aspects (encoder, processor, decoder), supported queries (operators, patterns, projected variables), datasets, evaluation metrics, and applications. Refining the CLQA task, we introduce the concept of Neural Graph Databases (NGDBs). Extending the idea of graph databases (graph DBs), NGDB consists of a Neural Graph Storage and a Neural Graph Engine. Inside Neural Graph Storage, we design a graph store, a feature store, and further embed information in a latent embedding store using an encoder. Given a query, Neural Query Engine learns how to perform query planning and execution in order to efficiently retrieve the correct results by interacting with the Neural Graph Storage. Compared with traditional graph DBs, NGDBs allow for a flexible and unified modeling of features in diverse modalities using the embedding store. Moreover, when the graph is incomplete, they can provide robust retrieval of answers which a normal graph DB cannot recover. Finally, we point out promising directions, unsolved problems and applications of NGDB for future research.
翻译:复杂逻辑查询回答(CLQA)是图机器学习的一项最近出现的任务,它超越了简单的单跳链接预测,并在潜在空间中解决了更复杂的多跳逻辑推理任务。该任务在学术界得到了广泛的关注,许多工作在理论和实践方面扩展了该领域,以处理不同类型的复杂查询和图形模态,并设计了高效的系统。在本文中,我们提供了一个全面的CLQA综述,使用详细的分类法从多个角度研究该领域,包括图形类型(模态、推理领域、背景语义)、建模方面(编码器、处理器、解码器)、支持的查询(运算符、模式、变量)、数据集、评估指标和应用。在优化CLQA任务的基础上,我们引入了神经图数据库(NGDB)的概念。扩展图数据库(Graph DBs)的思想,NGDB由神经图存储和神经图引擎组成。在神经图存储中,我们设计了图存储、特征存储,并利用编码器将信息嵌入到潜在的嵌入存储中。给定查询后,神经查询引擎学习如何执行查询规划和执行,以便通过与神经图存储进行交互来有效地检索正确的结果。与传统图数据库相比,NGDB可以使用嵌入存储灵活而统一地建模不同形式的特征,并在图形不完全时提供稳健的答案检索,而传统图数据库无法恢复答案。最后,我们指出了NGDB未来研究的有前途的方向、未解决的问题和应用。