In this paper, we develop a general framework based on the Transformer architecture to address a variety of challenging treatment effect estimation (TEE) problems. Our methods are applicable both when covariates are tabular and when they consist of sequences (e.g., in text), and can handle discrete, continuous, structured, or dosage-associated treatments. While Transformers have already emerged as dominant methods for diverse domains, including natural language and computer vision, our experiments with Transformers as Treatment Effect Estimators (TransTEE) demonstrate that these inductive biases are also effective on the sorts of estimation problems and datasets that arise in research aimed at estimating causal effects. Moreover, we propose a propensity score network that is trained with TransTEE in an adversarial manner to promote independence between covariates and treatments to further address selection bias. Through extensive experiments, we show that TransTEE significantly outperforms competitive baselines with greater parameter efficiency over a wide range of benchmarks and settings.
翻译:在本文中,我们根据变异器结构制定了一个总体框架,以解决各种具有挑战性的处理效果估计(TEE)问题。当共变器以表格形式列出时,当共变器由序列组成(如文本)时,我们的方法都适用,并且能够处理离散、连续、结构化或剂量相关处理;虽然变异器已经作为包括自然语言和计算机视觉在内的不同领域的主导方法出现,但我们与变异器作为治疗效果模拟器(TransTEE)的实验表明,这些进化偏差对于旨在估计因果关系的研究中出现的估计问题和数据集也有效。此外,我们提议了一个以对抗方式与TransTEE培训的常态评分网络,以促进共变器和治疗之间的独立性,从而进一步解决选择偏差。我们通过广泛的实验,表明,TransTEE在一系列基准和环境中大大超越了具有更高参数效率的竞争基线。