Causal inference plays an important role in under standing the underlying mechanisation of the data generation process across various domains. It is challenging to estimate the average causal effect and individual causal effects from observational data with high-dimensional covariates due to the curse of dimension and the problem of data sufficiency. The existing matching methods can not effectively estimate individual causal effect or solve the problem of dimension curse in causal inference. To address this challenge, in this work, we prove that the reduced set by sufficient dimension reduction (SDR) is a balance score for confounding adjustment. Under the theorem, we propose to use an SDR method to obtain a reduced representation set of the original covariates and then the reduced set is used for the matching method. In detail, a non-parametric model is used to learn such a reduced set and to avoid model specification errors. The experimental results on real-world datasets show that the proposed method outperforms the compared matching methods. Moreover, we conduct an experiment analysis and the results demonstrate that the reduced representation is enough to balance the imbalance between the treatment group and control group individuals.
翻译:构造推论在维持不同领域数据生成过程的基本机械化方面起着重要作用;由于尺寸的诅咒和数据充足性问题,估算以高维共变的观测数据的平均因果关系效应和个别因果关系具有挑战性;现有的匹配方法无法有效地估计个人因果关系效应或解决因果推论中因果诅咒的问题;为了应对这一挑战,我们在这项工作中证明,通过足够尺寸减低设定的减量(SDR)是调整的平衡分。在理论下,我们提议使用特别提款权方法获得原始共变体的减少代表数组,然后将减量的一组用于匹配方法。详细而言,使用非参数模型来学习这种减量组,避免示范规格错误。真实世界数据集的实验结果显示,拟议的方法比对应方法要差。此外,我们进行实验分析,结果显示,减少的代表数足以平衡治疗组和控制组个人之间的不平衡。