Across research disciplines, cluster randomized trials (CRTs) are commonly implemented to evaluate interventions delivered to groups of participants, such as communities and clinics. Despite advances in the design and analysis of CRTs, several challenges remain. First, there are many possible ways to specify the causal effect of interest (e.g., at the individual-level or at the cluster-level). Second, the theoretical and practical performance of common methods for CRT analysis remain poorly understood. Here, we present a general framework to formally define an array of causal effects in terms of summary measures of counterfactual outcomes. Next, we provide a comprehensive overview of well-known CRT estimators, including the t-test and generalized estimating equations (GEE), as well as less well-known methods, including augmented-GEE and targeted maximum likelihood estimation (TMLE). Using finite sample simulations, we illustrate the practical performance of these estimators for different causal effects and when, as commonly occurs, there are limited numbers of clusters of varying size. Finally, our application to data from the Preterm Birth Initiative (PTBi) study demonstrates the real-world impact of varying cluster sizes and targeting effects at the cluster-level or at the individual-level. Given its flexibility to estimate a variety of user-specified effects and ability to adaptively adjust for covariates for precision gains while maintaining Type-I error control, we conclude TMLE is a promising tool for CRT analysis.
翻译:在研究学科之间,通常实施集群随机试验(CRTs),以评价向社区和诊所等参与者群体提供的干预措施。尽管在设计和分析CRTs方面有所进展,但仍存在若干挑战。首先,有许多可能的方法可以具体说明兴趣的因果关系(例如,在个人一级或集群一级)。第二,CRT分析共同方法的理论和实际表现仍然不甚了解。在这里,我们提出了一个总体框架,以便正式确定一系列对反事实结果的总结措施的因果关系。接着,我们全面概述了众所周知的CRT估计数据,包括测试和普遍估计方程(GEEE),以及不太为人所知的方法,包括扩大GEEE和有针对性的最大可能性估计(TMLE)等。我们用有限的抽样模拟来说明这些估计者在不同因果关系影响方面的实际表现,而且通常出现数量不等的群集。最后,我们对预产倡议(PTBI)研究的数据应用了全面的CRT, 显示了C-C-C-C-C-C-C-C-C-C-C-LE-LV-LA级上不同组合规模的灵活度和定数性分析对用户级的准确性分析结果产生的实际影响。