Schema and data integration have been a challenge for more than 40 years. While data warehouse technologies are quite a success story, there is still a lack of information integration methods, especially if the data sources are based on different data models or do not have a schema. Enterprise Information Integration has to deal with heterogeneous data sources and requires up-to-date high-quality information to provide a reliable basis for analysis and decision making. The paper proposes virtual integration using the Typed Graph Model to support schema mediation. The integration process first converts the structure of each source into a typed graph schema, which is then matched to the mediated schema. Mapping rules define transformations between the schemata to reconcile semantics. The mapping can be visually validated by experts. It provides indicators and rules to achieve a consistent schema mapping, which leads to high data integrity and quality.
翻译:40多年来,Schema和数据整合一直是一个挑战。虽然数据仓技术相当成功,但信息整合方法仍然缺乏,特别是如果数据源基于不同的数据模型或没有系统模式。企业信息整合必须处理各种数据源,并要求最新的高质量信息为分析和决策提供可靠的基础。文件建议使用“类型图模型”进行虚拟整合,以支持系统调解。整合过程首先将每个来源的结构转换成一个打印式图表系统图,然后与经过调解的系统图匹配。绘图规则界定了系统图之间的转换,以调和语义。制图可以由专家以视觉方式验证,提供指标和规则,以实现一致的系统制图,从而实现高数据完整性和质量。