This paper develops a sparsity-inducing version of Bayesian Causal Forests, a recently proposed nonparametric causal regression model that employs Bayesian Additive Regression Trees and is specifically designed to estimate heterogeneous treatment effects using observational data. The sparsity-inducing component we introduce is motivated by empirical studies where not all the available covariates are relevant, leading to different degrees of sparsity underlying the surfaces of interest in the estimation of individual treatment effects. The extended version presented in this work, which we name Shrinkage Bayesian Causal Forest, is equipped with an additional pair of priors allowing the model to adjust the weight of each covariate through the corresponding number of splits in the tree ensemble. These priors improve the model's adaptability to sparse data generating processes and allow to perform fully Bayesian feature shrinkage in a framework for treatment effects estimation, and thus to uncover the moderating factors driving heterogeneity. In addition, the method allows prior knowledge about the relevant confounding covariates and the relative magnitude of their impact on the outcome to be incorporated in the model. We illustrate the performance of our method in simulated studies, in comparison to Bayesian Causal Forest and other state-of-the-art models, to demonstrate how it scales up with an increasing number of covariates and how it handles strongly confounded scenarios. Finally, we also provide an example of application using real-world data.
翻译:本文开发了一种无孔不入的Bayesian Causal Forest, 这是一种最近提出的非参数性因果回归模型,使用Bayesian Additive Regression 树,专门设计用于利用观测数据估计不同处理效果。我们引入的偏振性诱导部分是由经验研究推动的,因为并非所有可用的共变体都具有相关性,导致在估计个人治疗效果方面感兴趣的表面下层存在不同程度的宽度。本文中介绍的扩展版,我们称之为Shrinkage Bayesian Causal Forest, 配有另外一对前一款,使模型能够通过树群中相应数量的拆分来调整每种共变数的重量。这些前文改进了模型的适应性,使该模型在治疗效果估计框架中能够充分演化,从而发现驱动异性变异性的调因素。此外,该方法还允许事先了解相关变异性以及这些变异性及其对结果的相对规模,使模型能够通过树团图中的相应分数来调整每个变形体的重量。这些变异性模型是如何在模拟模型中进行模拟的。我们在模拟模型中的模型中如何进行模拟的模型和模拟的模型中进行模拟的。