We present a practical guide for the analysis of regression discontinuity (RD) designs in biomedical contexts. We begin by introducing key concepts, assumptions, and estimands within both the continuity-based framework and the local randomization framework. We then discuss modern estimation and inference methods within both frameworks, including approaches for bandwidth or local neighborhood selection, optimal treatment effect point estimation, and robust bias-corrected inference methods for uncertainty quantification. We also overview empirical falsification tests that can be used to support key assumptions. Our discussion focuses on two particular features that are relevant in biomedical research: (i) fuzzy RD designs, which often arise when therapeutic treatments are based on clinical guidelines, but patients with scores near the cutoff are treated contrary to the assignment rule; and (ii) RD designs with discrete scores, which are ubiquitous in biomedical applications. We illustrate our discussion with three empirical applications: the effect CD4 guidelines for anti-retroviral therapy on retention of HIV patients in South Africa, the effect of genetic guidelines for chemotherapy on breast cancer recurrence in the United States, and the effects of age-based patient cost-sharing on healthcare utilization in Taiwan. Complete replication materials employing publicly available statistical software in Python, R and Stata are provided, offering researchers all necessary tools to conduct an RD analysis.
翻译:我们在生物医学背景下为分析回归不连续(RD)设计提供了实用指南。我们首先在连续性框架和地方随机化框架内引入关键概念、假设和估计,然后在两个框架内讨论现代估算和推断方法,包括带宽或本地邻居选择方法、最佳治疗效果点估计以及稳健的、纠正偏差的推断方法,以量化不确定性。我们还概述了可用于支持关键假设的经验性伪造测试。我们的讨论侧重于生物医学研究中相关的两个具体特征:(一) 模糊的RD设计,当治疗治疗以临床准则为基础,但接近截断点的病人治疗违背任务规则时,经常出现模糊的RD设计;(二) RD设计,分数在生物医学应用中普遍存在。我们用三种经验性应用来说明我们的讨论:CD4抗逆转录病毒疗法准则对留住南非艾滋病毒病人的影响,美国乳癌复发基因治疗准则的影响,以及基于年龄的病人费用共享对台湾健康保健利用情况进行必要分析的影响。我们用三种经验性应用来说明我们的讨论。我们用CD4反逆转录疗法准则向台湾的统计数据库提供所有必要的复制材料。