Causal inference involves the disentanglement of effects due to a treatment variable from those of confounders, observed as covariates or not. Since one outcome is ever observed at a time, the problem turns into one of predicting counterfactuals on every individual in the dataset. Observational studies complicate this endeavor by permitting dependencies between the treatment and other variables in the sample. If the covariates influence the propensity of treatment, then one suffers from covariate shift. Should the outcome and the treatment be affected by another variable even after accounting for the covariates, there is also hidden confounding. That is immeasurable by definition. Rather, one must study the worst possible consequences of bounded levels of hidden confounding on downstream decision-making. We explore this problem in the case of continuous treatments. We develop a framework to compute ignorance intervals on the partially identified dose-response curves, which enable us to quantify the susceptibility of our inference to hidden confounders. Our method is supported by simulations as well as empirical tests based on two observational studies.
翻译:因果关系的推论涉及因治疗变异而产生的效果的分解,而治疗变异则与混杂者的变异,作为共变或非共变观察到的变异。由于一次观察到一个结果,问题就变成了对数据集中每个人的反事实作出预测的问题。观察研究通过允许治疗与抽样中其他变异之间的依赖性使这一努力复杂化。如果共变影响治疗的倾向性,那么就会发生共变变化。如果结果和治疗受到另一个变异的影响,即使计算了共变后,也会隐藏混杂。这在定义上是无法衡量的。相反,我们必须研究下游决策中隐藏混杂程度的相近可能带来的最坏后果。我们在持续治疗的情况下探讨这一问题。我们开发了一个框架,对部分识别的剂量反应曲线进行计算无知间隔,从而使我们能够量化我们对隐蔽的推断的易感性。我们的方法得到两次观察研究的模拟和实证试验的支持。