In this paper we address the problem of handling inconsistencies in tables with missing values (also called nulls) and functional dependencies. Although the traditional view is that table instances must respect all functional dependencies imposed on them, it is nevertheless relevant to develop theories about how to handle instances that violate some dependencies. Regarding missing values, we make no assumptions on their existence: a missing value exists only if it is inferred from the functional dependencies of the table. We propose a formal framework in which each tuple of a table is associated with a truth value among the following: true, false, inconsistent or unknown; and we show that our framework can be used to study important problems such as consistent query answering, table merging, and data quality measures - to mention just a few. In this paper, however, we focus mainly on consistent query answering, a problem that has received considerable attention during the last decades. The main contributions of the paper are the following: (a) we introduce a new approach to handle inconsistencies in a table with nulls and functional dependencies, (b) we give algorithms for computing all true, inconsistent and false tuples, (c) we investigate the relationship between our approach and Four-valued logic in the context of data merging, and (d) we give a novel solution to the consistent query answering problem and compare our solution to that of table repairs.
翻译:在本文中,我们处理的是处理缺少价值(也称为无效)和功能依赖性表格不一致的问题。虽然传统观点是,表格实例必须尊重对其施加的所有功能依赖性,但有必要制定如何处理违反某些依赖性的案例的理论。关于缺失价值,我们不对其存在作出假设:缺失的价值只有在从表格的功能依赖性推断得出时才存在。我们建议了一个正式框架,使表格的每一部分都与下列各点的真理价值相联系:真实、虚假、前后不一致或未知;我们表明,我们的框架可以用来研究重要问题,例如连贯一致的问答、表格合并和数据质量措施,仅举几个例子。然而,在本文件中,我们主要侧重于一致的回答问题,一个在过去几十年中受到相当重视的问题。文件的主要贡献如下:(a)我们采用一种新办法,处理表格中与无效和功能依赖性相关的不一致之处,(b)我们为计算所有真实、不一致和不准确的图调,以及数据质量措施等重要问题,例如一致的回答,(c)我们研究四种解决办法之间的逻辑关系,我们将我们的方法与不断的理算。