This article introduces a methodology to perform live time-traversal queries on RDF datasets and software based on this procedure. It offers a solution to manage the provenance and change-tracking of entities described using RDF. Although these two aspects are crucial factors in ensuring verifiability and trust, some of the most prominent knowledge bases - including DBpedia, Wikidata, Yago, and the Dynamic Linked Data Observatory - do not support time-agnostic SPARQL queries, i.e. queries across the various statuses an entity may have assumed in time. The OpenCitations Data Model (OCDM) describes one possible way to track provenance and entities' changes in RDF datasets, and it allows restoring an entity to a specific status in time (i.e. a snapshot) by applying SPARQL update queries. The methodology and library presented in this article are based on the rationale introduced in the OCDM. We also develop benchmarks proving that such a procedure is efficient for specific queries and less efficient for others. To date, as far as we know, our library is the only software supporting all the time-related retrieval functionalities without pre-indexing data.
翻译:本文介绍了基于此程序对RDF数据集和软件进行实时实时时间跨查询的方法,为管理使用RDF的各实体的出处和变更跟踪提供了解决办法,尽管这两个方面是确保可核查和信任的关键因素,但一些最突出的知识基础,包括DBpedia、Wikigata、Yago和动态链接数据观测站,并不支持时间-不可知性 SPARQL查询,即一个实体在不同地位上可能及时获得的查询。开放数据模型(OCDM)描述了跟踪出处和实体在RDF数据集中的变化的一种可能的方法,它通过应用SPARQL更新查询,使一个实体恢复到特定的时间状态(即快照)。本文章中介绍的方法和图书馆基于OCDM中引入的理由。我们还制定了基准,证明这种程序对于特定查询是有效的,对其他实体来说效率较低。据我们所知,我们的图书馆是唯一支持所有时间相关功能的软件,无需索引前的数据检索。