Wikidata is the largest general-interest knowledge base that is openly available. It is collaboratively edited by thousands of volunteer editors and has thus evolved considerably since its inception in 2012. In this paper, we present Wikidated 1.0, a dataset of Wikidata's full revision history, which encodes changes between Wikidata revisions as sets of deletions and additions of RDF triples. To the best of our knowledge, it constitutes the first large dataset of an evolving knowledge graph, a recently emerging research subject in the Semantic Web community. We introduce the methodology for generating Wikidated 1.0 from dumps of Wikidata, discuss its implementation and limitations, and present statistical characteristics of the dataset.
翻译:维基数据是公开提供的最大普通利益知识库,由数千名自愿编辑协作编辑,自2012年启动以来发生了很大变化。 在本文中,我们介绍了维基数据完整修订史的数据集维基数据1.0,其中将维基数据修订作为删除和添加RDF三联的数据集进行编码。据我们所知,它是一个不断演变的知识图的第一个大数据集,这是语义网络界最近出现的一个研究课题。我们引入了从维基数据堆放处生成维基数据1.0的维基数据方法,讨论其实施和局限性,并介绍数据集的统计特征。