Popular guidance on observational data analysis states that outcomes should be blinded when determining matching criteria or propensity scores. Such a blinding is informally said to maintain the "objectivity" of the analysis, and to prevent analysts from fishing for positive results by exploiting chance imbalances. Contrary to this notion, we show that outcome blinding is not a sufficient safeguard against fishing. Blinded and unblinded analysts can produce bias of the same order of magnitude in cases where the outcomes can be approximately predicted from baseline covariates. We illustrate this vulnerability with a combination of analytical results and simulations. Finally, to show that outcome blinding is not necessary to prevent bias, we outline an alternative sample partitioning procedure for estimating the average treatment effect on the controls, or the average treatment effect on the treated. This procedure uses all of the the outcome data from all partitions in the final analysis step, but does not require the analysis to not be fully prespecified.
翻译:关于观测数据分析的大众指导指出,在确定匹配标准或倾向分数时,结果应当被蒙蔽。这种盲点被非正式地说保持分析的“客观性”并阻止分析师通过利用机会不平衡来捕捉积极结果。与这一概念相反,我们表明,结果盲点不足以防止捕鱼。盲点和非盲点分析师在从基线变量大致预测结果时,可产生相同程度的偏差。我们用分析结果和模拟相结合的方式来说明这种脆弱性。最后,为了证明结果盲点对于防止偏差没有必要,我们概述了一种替代的样本分割程序,用以估计对控制的平均治疗效果,或对被治疗者的平均治疗效果。这一程序使用了最后分析步骤中所有分区的结果数据,但并不要求分析不完全提前进行。