A causal query will commonly not be identifiable from observed data, in which case no estimator of the query can be contrived without further assumptions or measured variables, regardless of the amount or precision of the measurements of observed variables. However, it may still be possible to derive symbolic bounds on the query in terms of the distribution of observed variables. Bounds, numeric or symbolic, can often be more valuable than a statistical estimator derived under implausible assumptions. Symbolic bounds, however, provide a measure of uncertainty and information loss due to the lack of an identifiable estimand even in the absence of data. We develop and describe a general approach for computation of symbolic bounds and characterize a class of settings in which our method is guaranteed to provide tight valid bounds. This expands the known settings in which tight causal bounds are solutions to linear programs. We also prove that our method can provide valid and possibly informative symbolic bounds that are not guaranteed to be tight in a larger class of problems. We illustrate the use and interpretation of our algorithms in three examples in which we derive novel symbolic bounds.
翻译:通常无法从观察到的数据中辨别因果关系,在这种情况下,无论观测到的变量的测量数量或精确度如何,都无法在没有进一步假设或测量变量的情况下,推断出查询的估算者。然而,从所观察到变量的分布上,仍有可能从查询中得出象征性的界限。圆形、数字或符号往往比根据不可信的假设得出的统计估计者更有价值。但是,符号界限提供了某种程度的不确定性和信息损失,因为缺乏可识别的估计值,即使在缺乏数据的情况下也是如此。我们制定和描述一种计算符号界限的一般方法,并描述一种我们的方法保证提供严格有效界限的环境类别。这扩大了已知的环境,在这种环境中,紧因果关系是线性程序的解决办法。我们还证明,我们的方法可以提供有效和可能具有信息性的符号界限,而在更大的问题类别中无法保证这些界限是紧凑的。我们用三个例子来说明我们的算法的用法和解释,我们从中得出了新的符号界限。