We study the problem of answering queries when (part of) the data may be sensitive and should not be leaked to the querier. Simply restricting the computation to non-sensitive part of the data may leak sensitive data through inference based on data dependencies. While inference control from data dependencies during query processing has been studied in the literature, existing solution either detect and deny queries causing leakage, or use a weak security model that only protects against exact reconstruction of the sensitive data. In this paper, we adopt a stronger security model based on full deniability that prevents any information about sensitive data to be inferred from query answers. We identify conditions under which full deniability can be achieved and develop an efficient algorithm that minimally hides non-sensitive cells during query processing to achieve full deniability. We experimentally show that our approach is practical and scales to increasing proportion of sensitive data, as well as, to increasing database size
翻译:我们研究在(部分)数据可能敏感时回答询问的问题,而不应将问题透露给querier。简单地将计算限制在数据中非敏感部分,可能通过基于数据依赖性的推论泄漏敏感数据。虽然文献中研究了在查询处理过程中对数据依赖性进行推断控制的问题,但现有解决办法要么发现并拒绝导致泄漏的查询,要么使用仅保护敏感数据准确重建的薄弱安全模型。在本文中,我们采用了基于完全可否认性的更强有力的安全模式,防止从查询答复中推断出关于敏感数据的任何信息。我们确定在何种条件下可以实现完全可否认性,并开发一种有效的算法,在查询处理过程中尽可能隐藏非敏感单元格,以达到完全可忽略性。我们实验性地表明,我们提高敏感数据比例的方法是实用的,规模是提高敏感数据的比例,以及增加数据库的规模。