Quasi-experimental causal inference methods have become central in empirical operations management (OM) for guiding managerial decisions. Among these, empiricists utilize the Difference-in-Differences (DiD) estimator, which relies on the parallel trends assumption. To improve its plausibility, researchers often match treated and control units before applying DiD, with the intuition that matched groups are more likely to evolve similarly absent treatment. Existing work that analyze this practice, however, has focused solely on bias. We complement and fill an important gap by analyzing the full bias-variance tradeoff. Under a linear structural model with unobserved time-varying confounders, we show that variance results contrast with established bias insights: matching on observed covariates prior to DiD is not always recommended over the classic (unmatched) DiD due to a sample size tradeoff; furthermore, matching additionally on pre-treatment outcomes is always beneficial as such tradeoff no longer exists once matching is performed. We therefore advocate mean squared error (MSE) as a final metric and give practitioner-friendly guidelines with theoretical guarantees on when (and on what variables) they should match on. We apply these insights to a recent study on how the introduction of monetary incentives by a knowledge-sharing platform affects its general engagement and show that the authors' matching choice prior to DiD was both warranted and critical. In particular, we provide new managerial insights that after a full bias correction, their estimated effect with matching still remains statistically significant, demonstrating that the chosen matching-DiD approach is sufficiently robust to address managerial concerns over violations of parallel trends.
翻译:准实验因果推断方法已成为实证运营管理领域指导管理决策的核心工具。其中,研究者广泛采用依赖平行趋势假设的双重差分估计量。为提高该假设的合理性,学者常在应用双重差分前对处理组与对照组进行匹配,其直觉在于匹配后的群体在无干预情况下更可能呈现相似演变趋势。然而,现有分析该实践的研究仅聚焦于偏差维度。本文通过系统分析完整的偏差-方差权衡,填补了这一重要研究空白。在线性结构模型框架下考虑未观测时变混杂因素时,我们发现方差结果与既有偏差结论形成鲜明对比:由于样本量权衡的存在,在双重差分前基于观测协变量进行匹配并非总是优于经典(未匹配)双重差分;而额外匹配预处理结果则始终具有增益效应,因为匹配后此类权衡将不复存在。为此,我们主张采用均方误差作为最终评估指标,并提供具有理论保证的实践指南,明确何时(及基于何种变量)应进行匹配。我们将这些见解应用于近期关于知识共享平台引入货币激励如何影响用户参与度的研究,证明原作者在双重差分前采用的匹配策略不仅合理且至关重要。特别地,我们通过全偏差校正后匹配估计量仍保持统计显著性这一新发现,揭示了所选匹配-双重差分方法具有充分稳健性,能够有效应对平行趋势假设违背引发的管理决策顾虑。