We propose simple estimators for mediation analysis and dynamic treatment effects over short horizons, which preserve the nonlinearity, dependence, and effect modification of identification theory. We allow treatments, mediators, and covariates to be discrete or continuous in general spaces. Across this broad variety of data settings, the estimators have closed form solutions in terms of kernel matrix operations due to our algorithmic innovation: sequential mean embedding of the mediator and covariate conditional distributions given a hypothetical treatment sequence. The simple estimators have strong guarantees. For the continuous treatment case, we prove uniform consistency with finite sample rates that match the minimax optimal rate for standard kernel ridge regression. For the discrete treatment case, we prove $n^{-1/2}$ consistency, finite sample Gaussian approximation, and semiparametric efficiency. We extend the analysis to incremental effects and counterfactual distributions, identifying and estimating new causal estimands. In nonlinear simulations with many covariates, we demonstrate state-of-the-art performance. We estimate mediated and dynamic treatment effects of the US Job Corps program for disadvantaged youth, and share a cleaned data set that may serve as a benchmark in future work.
翻译:我们提出短期调解分析和动态处理效果的简单估计值,以保持识别理论的非线性、依赖性和效果的修改。我们允许治疗、调停和共变在一般空间中是离散的或连续的。在数据设置的这一广泛多样性中,由于我们的算法创新,估计值在内核矩阵操作方面有封闭式的解决办法:按顺序平均嵌入调解人,根据假设的治疗顺序确定共同有条件分布。简单估计值有强有力的保证。对于持续治疗案例,我们证明与符合标准内核脊脊回归最低最佳比率的有限抽样率一致。对于离散治疗案例,我们证明美元/升/升/2美元的一致性、有限的抽样高斯近似值和半偏差效率。我们把分析扩大到递增效应和反事实分布,确定和估计新的因果关系估计值。在与许多同级的非线性模拟中,我们展示了最新的业绩。我们估计了美国职业团未来工作基准项目中的介质和动态处理效果,为劣势青年提供清洁的数据。