复原和因果关系统一办法,以整数线性方案编制和LP的放松限制 (A Unified Approach for Resilience and Causal Responsibility with Integer Linear Programming (ILP) and LP Relaxations)

Resilience is one of the key algorithmic problems underlying various forms of reverse data management (such as view maintenance, deletion propagation, and various interventions for fairness): What is the minimal number of tuples to delete from a database in order to remove all answers from a query? A long-open question is determining those conjunctive queries (CQs) for which this problem can be solved in guaranteed PTIME. We shed new light on this and the related problem of causal responsibility by proposing a unified Integer Linear Programming (ILP) formulation. It is unified in that it can solve both prior studied restrictions (e.g., self-join-free CQs under set semantics that allow a PTIME solution) and new cases (e.g., all CQs under set or bag semantics It is also unified in that all queries and all instances are treated with the same approach, and the algorithm is guaranteed to terminate in PTIME for the easy cases. We prove that, for all easy self-join-free CQs, the Linear Programming (LP) relaxation of our encoding is identical to the ILP solution and thus standard ILP solvers are guaranteed to return the solution in PTIME. Our approach opens up the door to new variants and new fine-grained analysis: 1) It also works under bag semantics and we give the first dichotomy result for bags semantics in the problem space. 2) We give a more fine-grained analysis of the complexity of causal responsibility. 3) We recover easy instances for generally hard queries, such as instances with read-once provenance and instances that become easy because of Functional Dependencies in the data. 4) We solve an open conjecture from PODS 2020. 5) Experiments confirm that our results indeed predict the asymptotic running times, and that our universal ILP encoding is at times even faster to solve for the PTIME cases than a prior proposed dedicated flow algorithm.

翻译：复原力是不同反向数据管理形式(例如,视图维护、删除传播和各种公平干预)背后的关键算法问题之一:从数据库中删除最小的图纸数量是多少,从数据库中删除,以便从查询中排除所有答案?一个长期的疑问是,确定在有保障的 PTIME 中可以解决这一问题的连结查询(CQs ) 。我们提出了统一的 Integer 线性程序(ILP) 配置, 从而对此及相关的因果关系问题有了新的了解。它的统一在于它既能解决以前研究过的限制(例如,在设定的内脏解调中,自join自由的CQ),又能解决以前研究过的限制(例如,在设置的内脏解析中,自joy-join CQs)和新案例(例如,在设置的STIME 解析中,所有CQQQQQs ) 的简单化解析(自制) 也能够解决这些难题,因此,在IML IML IM 的解算中,我们的新解算和标准 IL IML IML 解算中, 解算的解算过程也是正常的。

相关内容

ILP

关注 130

归纳逻辑程序设计（ILP）是机器学习的一个分支，它依赖于逻辑程序作为一种统一的表示语言来表达例子、背景知识和假设。基于一阶逻辑的ILP具有很强的表示形式，为多关系学习和数据挖掘提供了一种很好的方法。International Conference on Inductive Logic Programming系列始于1991年，是学习结构化或半结构化关系数据的首要国际论坛。最初专注于逻辑程序的归纳，多年来，它大大扩展了研究范围，并欢迎在逻辑学习、多关系数据挖掘、统计关系学习、图形和树挖掘等各个方面作出贡献，学习其他（非命题）基于逻辑的知识表示框架，探索统计学习和其他概率方法的交叉点。官网链接：https://ilp2019.org/

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日