With the rapid increase of published open datasets, it is crucial to support the open data progress in smart cities while considering the open data quality. In the Czech Republic, and its National Open Data Catalogue (NODC), the open datasets are usually evaluated based on their metadata only, while leaving the content and the adherence to the recommended data structure to the sole responsibility of the data providers. The interoperability of open datasets remains unknown. This paper therefore aims to propose a novel content-aware quality evaluation framework that assesses the quality of open datasets based on five data quality dimensions. With the proposed framework, we provide a fundamental view on the interoperability-oriented data quality of Czech open datasets, which are published in NODC. Our evaluations find that domain-specific open data quality assessments are able to detect data quality issues beyond traditional heuristics used for determining Czech open data quality, increase their interoperability, and thus increase their potential to bring value for the society. The findings of this research are beneficial not only for the case of the Czech Republic, but also can be applied in other countries that intend to enhance their open data quality evaluation processes.
翻译:在捷克共和国及其国家开放数据目录(NODC)中,开放数据集通常仅根据其元数据进行评估,而将内容和对建议的数据结构的遵守情况留给数据提供者单独负责。开放数据集的互操作性仍然不为人所知。因此,本文件旨在提出一个新的内容认知质量评价框架,根据五个数据质量层面评估开放数据集的质量。根据拟议框架,我们对捷克开放数据集的互操作性数据质量提供了基本看法,这些数据集公布在NODC中。我们的评价发现,特定领域开放数据质量评估能够发现数据质量问题,超越用于确定捷克开放数据质量的传统超常性,提高互操作性,从而增加其为社会带来价值的潜力。这一研究结果不仅对捷克共和国有利,而且可以应用于打算加强其公开数据质量评价进程的其他国家。</s>