Virtual Knowledge Graphs (VKG) constitute one of the most promising paradigms for integrating and accessing legacy data sources. A critical bottleneck in the integration process involves the definition, validation, and maintenance of mappings that link data sources to a domain ontology. To support the management of mappings throughout their entire lifecycle, we propose a comprehensive catalog of sophisticated mapping patterns that emerge when linking databases to ontologies. To do so, we build on well-established methodologies and patterns studied in data management, data analysis, and conceptual modeling. These are extended and refined through the analysis of concrete VKG benchmarks and real-world use cases, and considering the inherent impedance mismatch between data sources and ontologies. We validate our catalog on the considered VKG scenarios, showing that it covers the vast majority of patterns present therein.
翻译:虚拟知识图(VKG)是整合和获取遗留数据来源最有希望的范例之一。整合过程中的一个关键瓶颈是界定、验证和维护将数据来源与域本体学联系起来的绘图。为了支持对测绘的整个生命周期进行管理,我们建议建立一个综合的复杂绘图模式目录,在将数据库与本体学联系起来时出现。为此,我们利用了在数据管理、数据分析和概念建模方面研究的既定方法和模式。通过分析具体的VKG基准和实际世界使用案例,并考虑到数据来源与本体学之间固有的阻碍不匹配,这些方法和模式得到扩展和完善。我们验证了我们关于所考虑的VKG情景的目录,表明它涵盖了其中的绝大多数模式。