This paper presents an approach for metadata reconciliation, curation and linking for Open Governamental Data Portals (ODPs). ODPs have been lately the standard solution for governments willing to put their public data available for the society. Portal managers use several types of metadata to organize the datasets, one of the most important ones being the tags. However, the tagging process is subject to many problems, such as synonyms, ambiguity or incoherence, among others. As our empiric analysis of ODPs shows, these issues are currently prevalent in most ODPs and effectively hinders the reuse of Open Data. In order to address these problems, we develop and implement an approach for tag reconciliation in Open Data Portals, encompassing local actions related to individual portals, and global actions for adding a semantic metadata layer above individual portals. The local part aims to enhance the quality of tags in a single portal, and the global part is meant to interlink ODPs by establishing relations between tags.
翻译:本文为开放政府数据门户(ODPs)的元数据调节、整理和链接提供了一种方法。 ODPs最近成为愿意向社会提供其公共数据的政府的标准解决办法。门户管理者使用几种类型的元数据来组织数据集,其中最重要的一种是标签。然而,标记过程遇到了许多问题,例如同义词、模糊或不一致等等。正如我们对ODPs的民意分析所显示的那样,这些问题目前在大多数ODPs中很普遍,并有效地阻碍了开放数据的再利用。为了解决这些问题,我们制定并执行一项在开放数据门户中标记协调的方法,其中包括与单个门户有关的当地行动,以及在单个门户上添加语义元数据层的全球行动。当地部分的目的是提高单一门户标签的质量,而全球部分则旨在通过建立标签之间的关系将ODPs连接起来。