We describe a simple approach for combining an unbiased and a (possibly) biased estimator, and demonstrate its robustness to bias: estimate the error and cross-correlation of each estimator, and use these to construct a weighted combination that minimizes mean-squared error (MSE). Theoretically, we demonstrate that for any amount of (unknown) bias, the MSE of the resulting estimator is bounded by a small multiple of the MSE of the unbiased estimator. In simulation, we demonstrate that when the bias is sufficiently small, this estimator still yields notable improvements in MSE, and that as the bias increases without bound, the MSE of this estimator approaches that of the unbiased estimator. This approach applies to a range of problems in causal inference where combinations of unbiased and biased estimators arise. When small-scale experimental data is available, estimates of causal effects are unbiased under minimal assumptions, but may have high variance. Other data sources (such as observational data) may provide additional information about the causal effect, but potentially introduce biases. Estimators incorporating these data can be arbitrarily biased when the needed assumptions are violated. As a result, naive combinations of estimators can have arbitrarily poor performance. We show how to apply the proposed approach in these settings, and benchmark its performance in simulation against recent proposals for combining observational and experimental estimators. Here, we demonstrate that this approach shows improvement over the experimental estimator for a larger range of biases than alternative approaches.
翻译:我们描述一个简单的方法,将一个不偏不倚和(可能)偏向的估算值结合起来,并展示其稳健性,以显示偏向:估计每个估算值的错误和交叉关系,并用这些方法构建一个加权组合,以尽量减少平均偏差。理论上,我们证明,对于任何程度的(已知的)偏差(MSE),由此产生的估算值的MSE都与一个小数的(不偏向的)估算值的MSE相连接。在模拟中,我们证明,当偏向的估算值足够小时,该估算值的偏向性仍然会给MSE带来显著的改善,而随着偏向的增加,这一估算值的MSE将产生显著的改善,而这种偏向性的估算值则会增加。当提出偏向性和偏向性的估算值时,这些估算值将适用于一系列因果交错的问题。当获得小规模的实验数据时,对因果关系的估算值在最低假设下是公正的,但可能有很大的差异。其他数据来源(例如观察数据)可能提供有关因果关系的更多信息,但是当偏向偏向近期的偏向性环境展示时,这种偏向性假设的偏向性假设。