Today's international corporations such as BASF, a leading company in the crop protection industry, produce and consume more and more data that are often fragmented and accessible through Web APIs. In addition, part of the proprietary and public data of BASF's interest are stored in triple stores and accessible with the SPARQL query language. Homogenizing the data access modes and the underlying semantics of the data without modifying or replicating the original data sources become important requirements to achieve data integration and interoperability. In this work, we propose a federated data integration architecture within an industrial setup, that relies on an ontology-based data access method. Our performance evaluation in terms of query response time showed that most queries can be answered in under 1 second.
翻译:今天的国际公司,如作物保护行业的领先公司BASF, 生产并消费越来越多的数据,这些数据往往分散,通过网络API获取。此外,BASF利益的一部分专有和公共数据储存在三家商店中,用SPARQL查询语言提供。在不修改或复制原始数据来源的情况下对数据访问模式和数据的基本语义进行同质化,成为实现数据整合和互操作性的重要要求。在这项工作中,我们提议在工业结构中建立一个联合的数据整合结构,依靠基于理论的数据访问方法。我们在问答时间方面的业绩评估显示,大多数查询可以在不到一秒的时间里回答。