In this paper we address the problem of handling inconsistencies in tables with missing values (or nulls) and functional dependencies. Although the traditional view is that table instances must respect all functional dependencies imposed on them, it is nevertheless relevant to develop theories about how to handle instances that violate some dependencies. The usual approach to alleviate the impact of inconsistent data on the answers to a query is to introduce the notion of repair: a repair is a minimally di?erent consistent instance and an answer is consistent if it is present in every repair. Our approach is fundamentally di?erent: we use set theoretic semantics for tuples and functional dependencies that allow us to associate each tuple with a truth value among the following: true, false, inconsistent or unknown. The users of the table can then query the set of true tuples as usual. Regarding missing values, we make no assumptions on their existence: a missing value exists only if it is inferred from the functional dependencies of the table. The main contributions of the paper are the following: (a) we introduce a new approach to handle inconsistencies in a table with nulls and functional dependencies, (b) we give algorithms for computing all true, inconsistent and false tuples, (c) we discuss how our approach relates to Belnap's four valued logic, (d) we de- scribe how our approach can be applied to the consolidation of two or more tables and (e) we discuss the relationship between our approach and that of table repairs.
翻译:在本文中,我们处理的是处理缺少值(或无效)和功能依赖性表格不一致的问题。虽然传统观点认为表格实例必须尊重对其施加的所有功能依赖性,但有必要就如何处理违反某些依赖性的事件制定理论。减轻数据不一致对查询答案的影响的通常做法是引入修复概念:修复是一个微小的差错?一致性实例,如果每个修复都存在功能依赖性,答案是一致的。我们的方法从根本上是偏差的。我们采用的方法是:我们使用术语和功能依赖性术语的理论术语,使我们能够将每个图例与以下各点之间的真实价值联系起来:真实的、虚假的、不一致的或未知的。然后,表格的用户可以像往常一样查询一套真正的图例。关于缺失的价值,我们没有对其存在做出任何假设:只有从表格的功能依赖性来推断,才存在缺失的价值。我们的文件的主要贡献如下:(a)我们采用新的方法,在表格中处理每个图案的不一致之处,让我们用无效的和功能依赖性方法来讨论(我们如何将真实的、逻辑和功能依赖性关系联系起来),我们如何使这些图表的表格与真正和成本关系联系起来(我们如何讨论)。