Automatic knowledge graph construction aims to manufacture structured human knowledge. To this end, much effort has historically been spent extracting informative fact patterns from different data sources. However, more recently, research interest has shifted to acquiring conceptualized structured knowledge beyond informative data. In addition, researchers have also been exploring new ways of handling sophisticated construction tasks in diversified scenarios. Thus, there is a demand for a systematic review of paradigms to organize knowledge structures beyond data-level mentions. To meet this demand, we comprehensively survey more than 300 methods to summarize the latest developments in knowledge graph construction. A knowledge graph is built in three steps: knowledge acquisition, knowledge refinement, and knowledge evolution. The processes of knowledge acquisition are reviewed in detail, including obtaining entities with fine-grained types and their conceptual linkages to knowledge graphs; resolving coreferences; and extracting entity relationships in complex scenarios. The survey covers models for knowledge refinement, including knowledge graph completion, and knowledge fusion. Methods to handle knowledge evolution are also systematically presented, including condition knowledge acquisition, condition knowledge graph completion, and knowledge dynamic. We present the paradigms to compare the distinction among these methods along the axis of the data environment, motivation, and architecture. Additionally, we also provide briefs on accessible resources that can help readers to develop practical knowledge graph systems. The survey concludes with discussions on the challenges and possible directions for future exploration.
翻译:为了达到这一目的,我们历来花费大量精力从不同的数据来源中提取信息事实模式,然而,最近,研究兴趣已转向获得超越信息数据的概念化结构化知识;此外,研究人员还探索了在多种设想中处理复杂建筑任务的新方法;因此,需要系统地审查各种模式,以组织数据级以外的知识结构;为了满足这一需求,我们全面调查了300多种方法,以总结知识图建设的最新发展情况;一个知识图以三个步骤建立:知识获取、知识完善和知识演变;对知识获取过程进行了详细审查,包括获取具有精细类型的实体及其与知识图表的概念联系;解决共同参照问题;在复杂设想中提取实体关系;调查涵盖知识改进模式,包括知识图的完成和知识融合;为了满足这一需求,我们还系统地介绍了处理知识演变的方法,包括条件知识获取、条件知识图的完成和知识动态。我们介绍了这些方法与数据环境、动力和知识演变的轴心相比,对这些模式进行了比较;对知识获取过程进行了详细审查,包括获得与知识图表有细微种类的实体;以及它们的概念联系;解决共同参照的参照的图表和结构;我们还就可获取性研究提供了可能了解的图表提供的资源。