Randomized controlled trials (RCTs) are considered the gold standard for estimating the average treatment effect (ATE) of interventions. One use of RCTs is to study the causes of global poverty -- a subject explicitly cited in the 2019 Nobel Memorial Prize awarded to Duflo, Banerjee, and Kremer "for their experimental approach to alleviating global poverty." Because the ATE is a population summary, anti-poverty experiments often seek to unpack the effect variation around the ATE by conditioning (CATE) on tabular variables such as age and ethnicity that were measured during the RCT data collection. Although such variables are key to unpacking CATE, using only such variables may fail to capture historical, geographical, or neighborhood-specific contributors to effect variation, as tabular RCT data are often only observed near the time of the experiment. In global poverty research, when the location of the experiment units is approximately known, satellite imagery can provide a window into such factors important for understanding heterogeneity. However, there is no method that specifically enables applied researchers to analyze CATE from images. In this paper, using a deep probabilistic modeling framework, we develop such a method that estimates latent clusters of images by identifying images with similar treatment effects distributions. We also emphasize a sensitivity factor that quantifies the importance of image segments contributing to the mean effect cluster probabilities. We compare the proposed methods against alternatives in simulation; also, we show how the model works in an actual RCT, estimating the effects of an anti-poverty intervention in northern Uganda and obtaining a posterior predictive distribution over effects for the rest of the country where no experimental data was collected. We make the models available in an open-source package and discuss other applications.
翻译:使用RCT的方法之一是研究全球贫困的原因 -- -- 2019年诺贝尔纪念奖授予Duflo、Banerjee和Kremer的2019年Duflo、Banerjee和Kremer的纪念奖中明确提到这个问题,这是“其减轻全球贫困的实验方法”。由于ATE是一个人口概况,除贫实验往往试图通过调整(CATE)来解开在ATE周围的影响差异,例如,在RCT数据收集期间测量的年龄和族裔等表列变量。尽管这些变量是解开CATE的关键,但仅使用这些变量可能无法捕捉到历史、地理或特定街区的促进者来影响变异,因为表列RCT的数据往往只在试验时间附近被观察。在全球贫困研究中,当实验单位的位置大致为已知时,卫星图像可以打开一个窗口,了解这些要素对于理解异质性。然而,没有具体的方法可以让应用的研究人员从图像中分析CATE。在本文中,使用深度的预测模型模型框架,仅使用这些变量可能无法捕捉到历史、地理上的变异性效果,我们通过在比较的模型框架中,我们用这种方法来评估了一种潜在图像的图像的模型来评估一个相似的图像的模型,我们用来评估了一种稳定的模型,我们用一个稳定的图像的模型来评估了一种稳定的模型,我们用来评估了一种摄变数。