When estimating a Global Average Treatment Effect (GATE) under network interference, units can have widely different relationships to the treatment depending on a combination of the structure of their network neighborhood, the structure of the interference mechanism, and how the treatment was distributed in their neighborhood. In this work, we introduce a sequential procedure to generate and select graph- and treatment-based covariates for GATE estimation under regression adjustment. We show that it is possible to simultaneously achieve low bias and considerably reduce variance with such a procedure. To tackle inferential complications caused by our feature generation and selection process, we introduce a way to construct confidence intervals based on a block bootstrap. We illustrate that our selection procedure and subsequent estimator can achieve good performance in terms of root mean squared error in several semi-synthetic experiments with Bernoulli designs, comparing favorably to an oracle estimator that takes advantage of regression adjustments for the known underlying interference structure. We apply our method to a real world experimental dataset with strong evidence of interference and demonstrate that it can estimate the GATE reasonably well without knowing the interference process a priori.
翻译:在网络干扰下估计全球平均治疗效果时,单位与治疗的关系可能大不相同,这取决于其网络周边结构、干扰机制结构的组合,以及治疗方式如何在周边的分布。在这项工作中,我们引入了一种顺序程序,为GATE在回归调整下估算生成和选择基于图形和治疗的共变体。我们表明,有可能同时达到低偏差,并大大降低与这种程序的差异。为了处理由我们特性生成和选择过程造成的推断性并发症,我们引入了一种方法,在块式靴子上建立信任间隔。我们说明,我们的选择程序和随后的估算器可以在伯尔努利设计的若干半合成实验中,在根值平均的平方差方面实现良好的性表现,与利用已知基本干扰结构的回归调整的甲骨架估算器进行有利比较。我们将我们的方法应用于一个真实的世界实验数据集,有强烈的干扰证据,并表明它可以在事先不知道干扰过程的情况下对GATE进行合理估计。