We propose strategies to estimate and make inference on key features of heterogeneous effects in randomized experiments. These key features include best linear predictors of the effects using machine learning proxies, average effects sorted by impact groups, and average characteristics of most and least impacted units. The approach is valid in high dimensional settings, where the effects are proxied (but not necessarily consistently estimated) by predictive and causal machine learning methods. We post-process these proxies into estimates of the key features. Our approach is generic, it can be used in conjunction with penalized methods, neural networks, random forests, boosted trees, and ensemble methods, both predictive and causal. Estimation and inference are based on repeated data splitting to avoid overfitting and achieve validity. We use quantile aggregation of the results across many potential splits, in particular taking medians of p-values and medians and other quantiles of confidence intervals. We show that quantile aggregation lowers estimation risks over a single split procedure, and establish its principal inferential properties. Finally, our analysis reveals ways to build provably better machine learning proxies through causal learning: we can use the objective functions that we develop to construct the best linear predictors of the effects, to obtain better machine learning proxies in the initial step. We illustrate the use of both inferential tools and causal learners with a randomized field experiment that evaluates a combination of nudges to stimulate demand for immunization in India.
翻译:我们提出了在随机实验中估算和推断不同效应关键特征的战略。这些关键特征包括:使用机器学习代理物对影响进行最佳线性预测,按影响组进行平均影响分类,以及受影响最大和受影响最小单位的平均特征。这种方法在高维环境中是有效的,其影响通过预测和因果机学习方法得到代理(但不一定一致估计),我们将这些代理物处理成关键特征的估计。我们的方法是通用的,它可以与惩罚方法、神经网络、随机森林、增生树和共性方法一起使用。最后,我们的分析揭示了基于反复数据分解以避免过度适应和实现有效性的估算和推断方法。我们使用了许多潜在分裂的结果的四分集,特别是采用预测值和中位数的中位,以及信任间隔的其他四分法。我们表明,定量汇总会降低对单一分裂程序的风险的组合,可以同时使用任意的森林、增殖树树和共性方法。最后,我们的分析揭示了以反复的数据分割为基础,以避免重复为基础,避免过度调整并实现有效性。我们用最佳的实验工具来改进机序学前阶段的学习。我们获得最佳的实验工具,从而更好地进行我们获得最佳的理论学前期的实验工具。