Due to the lack of structure, scholarly knowledge remains hardly accessible for machines. Scholarly knowledge graphs have been proposed as a solution. Creating such a knowledge graph requires manual effort and domain experts, and is therefore time-consuming and cumbersome. In this work, we present a human-in-the-loop methodology used to build a scholarly knowledge graph leveraging literature survey articles. Survey articles often contain manually curated and high-quality tabular information that summarizes findings published in the scientific literature. Consequently, survey articles are an excellent resource for generating a scholarly knowledge graph. The presented methodology consists of five steps, in which tables and references are extracted from PDF articles, tables are formatted and finally ingested into the knowledge graph. To evaluate the methodology, 92 survey articles, containing 160 survey tables, have been imported in the graph. In total, 2,626 papers have been added to the knowledge graph using the presented methodology. The results demonstrate the feasibility of our approach, but also indicate that manual effort is required and thus underscore the important role of human experts.
翻译:由于缺乏结构,对于机器来说,学术知识仍然难以获得。学术知识图是一个解决办法。创建这种知识图需要人工和领域专家,因此耗费时间和繁琐。在这项工作中,我们提出了用于建立利用文献调查文章的学术知识图的 " 流动中的人 " 方法。调查文章往往包含人工整理和高质量的表格信息,汇总科学文献中公布的调查结果。因此,调查文章是制作学术知识图的极好资源。提出的方法包括五个步骤,其中从PDF文章中提取表格和参考资料,表格格式化并最后输入知识图中。为评估方法,在图表中输入了92篇调查文章,其中载有160个调查表。总共有2 626篇论文是使用所提出的方法在知识图中添加的。结果显示了我们的方法的可行性,但也表明需要手工工作,从而强调了人类专家的重要作用。