Inferring causal relationships from observational data is rarely straightforward, but the problem is especially difficult in high dimensions. For these applications, causal discovery algorithms typically require parametric restrictions or extreme sparsity constraints. We relax these assumptions and focus on an important but more specialized problem, namely recovering a directed acyclic subgraph of variables known to be causally descended from some (possibly large) set of confounding covariates, i.e. a $\textit{confounder blanket}$. This is useful in many settings, for example when studying a dynamic biomolecular subsystem with genetic data providing causally relevant background information. Under a structural assumption that, we argue, must be satisfied in practice if informative answers are to be found, our method accommodates graphs of low or high sparsity while maintaining polynomial time complexity. We derive a sound and complete algorithm for identifying causal relationships under these conditions and implement testing procedures with provable error control for linear and nonlinear systems. We demonstrate our approach on a range of simulation settings.
翻译:从观测数据中推断因果关系是很少简单明了的,但问题在高维方面特别困难。对于这些应用,因果发现算法通常需要参数限制或极端宽度限制。我们放松这些假设,侧重于一个重要但更为专门的问题,即从某些(可能大)混杂的共变体中回收已知因果性下降的、定向的单向分解变数子集(即$\ textit{confounder blanter}$),这在许多环境中是有用的,例如在研究遗传数据提供因果相关背景资料的动态生物分子子系统时。我们认为,如果找到信息性答案,我们的方法必须在实践中满足这一结构性假设,即我们的方法在保持多时复杂性的同时,能够容纳低或高聚度的图象。我们从这些条件下找出因果关系的正确和完整的算法,并在线性和非线性系统中采用可辨误控的测试程序。我们在一系列模拟环境中展示了我们的方法。