I consider the estimation of the average treatment effect (ATE), in a population that can be divided into $G$ groups, and such that one has unbiased and uncorrelated estimators of the conditional average treatment effect (CATE) in each group. These conditions are for instance met in stratified randomized experiments. I first assume that the outcome is homoscedastic, and that each CATE is bounded in absolute value by $B$ standard deviations of the outcome, for some known constant $B$. I derive, across all linear combinations of the CATEs' estimators, the estimator of the ATE with the lowest worst-case mean-squared error. This optimal estimator assigns a weight equal to group $g$'s share in the population to the most precisely estimated CATEs, and a weight proportional to one over the CATE's variance to the least precisely estimated CATEs. This optimal estimator is feasible: the weights only depend on known quantities. I then allow for heteroskedasticity and for positive correlations between the estimators. This latter condition is often met in differences-in-differences designs, where the CATEs are estimators of the effect of having been treated for a certain number of time periods. In that case, the optimal estimator is no longer feasible, as it depends on unknown quantities, but a feasible estimator can easily be constructed by replacing those unknown quantities by estimators.
翻译:我考虑对平均治疗效果(ATE)的估计,在可以分为G$组的人口中,平均治疗效果(ATE)可以分为G$组,而且每组人群中都有对有条件平均治疗效果(CATE)的不偏袒和不相干的估计。例如,这些条件在分层随机实验中得到满足。我首先假设结果是同质的,每个CATE的绝对值以结果的标准差以美元标准差数($B$标准差数)来约束一些已知的恒定不变的美元。我从CATE的估测者的所有线性组合中得出,ATE的估测者与最不均匀的平均差差数是没有偏差的。这个最佳估测者给人口分配的权重等于美元在最精确的CATE中所占的比重, 相对于CATE的比值比值是最小的。这个最佳估测算是可行的:重量只取决于已知的数量,然后我允许他进行偏差的估测度,而对于具有最难度的比对准性对应度,而具有最难度的比喻的比值。这些估测算的估测算者在最难的比值中,其比值是最难估测算的比值是一定的比值, 的比值是一定的比值, 的比值是比值的比值的比值为最低值是比值。