In causal inference, principal stratification is a framework for dealing with a posttreatment intermediate variable between a treatment and an outcome, in which the principal strata are defined by the joint potential values of the intermediate variable. Because the principal strata are not fully observable, the causal effects within them, also known as the principal causal effects, are not identifiable without additional assumptions. Several previous empirical studies leveraged auxiliary variables to improve the inference of principal causal effects. We establish a general theory for identification and estimation of the principal causal effects with auxiliary variables, which provides a solid foundation for statistical inference and more insights for model building in empirical research. In particular, we consider two commonly-used strategies for principal stratification problems: principal ignorability, and the conditional independence between the auxiliary variable and the outcome given principal strata and covariates. For these two strategies, we give non-parametric and semi-parametric identification results without modeling assumptions on the outcome. When the assumptions for neither strategies are plausible, we propose a large class of flexible parametric and semi-parametric models for identifying principal causal effects. Our theory not only establishes formal identification results of several models that have been used in previous empirical studies but also generalizes them to allow for different types of outcomes and intermediate variables.
翻译:在因果推断中,主要分层是处理治疗与结果之间的后处理中间变量的框架,其中主要阶层是由中间变量的共同潜在值界定的。由于主要阶层没有完全可见,因此其中的因果效应,也称为主要因果效应,在没有其他假设的情况下是无法辨认的。前几次经验研究利用辅助变量来改进主要因果效应的推论,以改进主要因果效应的推论。我们用辅助变量确定和估计主要因果效应的一般理论,为统计推论提供了坚实的基础,并为经验性研究的模型建设提供了更多的洞察力。特别是,我们考虑了两种常见的主要分层问题常用战略:主要可忽略性,以及辅助变量与给定的主要分层和共变数结果之间的有条件独立性。对于这两种战略,我们提供了非参数和半参数的识别结果,但没有对结果进行模型模型模型模型的建模。当对两个战略的假设都不合理时,我们建议为确定主要因果影响的模型提供大量灵活的弹性对数和半参数模型。我们的理论不仅确定了一些模型的正式识别结果,而且还允许对以前不同类型经验研究中所使用的模型进行一般的变数。