Estimating the average treatment effect (ATE) from observational data is challenging due to selection bias. Existing works mainly tackle this challenge in two ways. Some researchers propose constructing a score function that satisfies the orthogonal condition, which guarantees that the established ATE estimator is "orthogonal" to be more robust. The others explore representation learning models to achieve a balanced representation between the treated and the controlled groups. However, existing studies fail to 1) discriminate treated units from controlled ones in the representation space to avoid the over-balanced issue; 2) fully utilize the "orthogonality information". In this paper, we propose a moderately-balanced representation learning (MBRL) framework based on recent covariates balanced representation learning methods and orthogonal machine learning theory. This framework protects the representation from being over-balanced via multi-task learning. Simultaneously, MBRL incorporates the noise orthogonality information in the training and validation stages to achieve a better ATE estimation. The comprehensive experiments on benchmark and simulated datasets show the superiority and robustness of our method on treatment effect estimations compared with existing state-of-the-art methods.
翻译:根据观察数据估计平均治疗效果(ATE)由于选择偏好而具有挑战性。现有工作主要以两种方式应对这一挑战。一些研究人员建议构建一个符合正方形条件的评分功能,保证既定的ATE估计值“ortominal”更加稳健。其他研究人员探索代表性学习模式,以实现受治疗群体与受控制群体之间的均衡代表性。然而,现有研究未能(1) 将代表空间中受控制单位与受控单位区分开来,以避免过度平衡问题;(2) 充分利用“正方形信息”。在本文件中,我们提议基于近期的共变平衡代表性学习方法和或正方形机器学习理论,构建一个中等平衡的代表性学习框架。这一框架保护代表性不会通过多功能学习过度平衡。同时,MBRL在培训和验证阶段纳入噪音或多层信息,以获得更好的评估。基准和模拟数据集的全面实验表明,与现有状态方法相比,我们在治疗效果估计方法上具有优越性和稳健性。