Data Integration of heterogeneous data sources relies either on periodically transferring large amounts of data to a physical Data Warehouse or retrieving data from the sources on request only. The latter results in the creation of what is referred to as a virtual Data Warehouse, which is preferable when the use of the latest data is paramount. However, the downside is that it adds network traffic and suffers from performance degradation when the amount of data is high. In this paper, we propose the use of a readCheck validator to ensure the timeliness of the queried data and reduced data traffic. It is further shown that the readCheck allows transactions to update data in the data sources obeying full Atomicity, Consistency, Isolation, and Durability (ACID) properties.
翻译:不同数据来源的数据整合取决于将大量数据定期转入实物数据仓库,或仅根据请求从实际数据仓库获取数据,后者的结果是创建了所谓的虚拟数据仓库,在使用最新数据最为重要的情况下更可取;然而,其缺点是,它增加了网络流量,在数据数量高时出现性能退化;在本文件中,我们提议使用阅读检查验证器,以确保查询数据的及时性,减少数据流量;还进一步表明,阅读检查允许交易更新数据来源中的数据,使其符合完全原子性、一致性、隔离性和可变性(ACID)特性。