There have been increased concerns that the use of statins, one of the most commonly prescribed drugs for treating coronary artery disease, is potentially associated with the increased risk of new-onset type II diabetes (T2D). However, because existing clinical studies with limited sample sizes often suffer from selection bias issues, there is no robust evidence supporting as to whether and what kind of populations are indeed vulnerable for developing T2D after taking statins. In this case study, building on the biobank and electronic health record data in the Partner Health System, we introduce a new data analysis pipeline from a biological perspective and a novel statistical methodology that address the limitations in existing studies to: (i) systematically examine heterogeneous treatment effects of stain use on T2D risk, (ii) uncover which patient subgroup is most vulnerable to T2D after taking statins, and (iii) assess the replicability and statistical significance of the most vulnerable subgroup via bootstrap calibration. Our proposed bootstrap calibration approach delivers asymptotically sharp confidence intervals and debiased estimates for the treatment effect of the most vulnerable subgroup in the presence of possibly high-dimensional covariates. By implementing our proposed approach, we find that females with high T2D genetic risk at baseline are indeed at high risk of developing T2D due to statin use, which provides evidences to support future clinical decisions with respect to statin use.
翻译:越来越多的人担心,使用死病这一治疗冠状动脉病的最常用处方药物之一,可能与新陈定型二型糖尿病(T2D)风险增加有关。 但是,由于现有抽样规模有限的临床研究往往受到选择偏差问题的影响,因此没有确凿的证据支持在服用死病后,是否和哪种人口在开发T2D方面确实脆弱。在这项案例研究中,我们从生物角度引入了一个新的数据分析管道,并引入了一个新的统计方法,以解决现有研究中的局限性,即:(一) 系统地审查污点使用对T2D风险的多种治疗影响,(二) 发现哪个病人分组在服用死病后最易受T2D风险,(三) 评估最脆弱分组通过靴子校准的可复制性和统计意义。我们提议的靴带校准方法从伙伴保健系统的生物库和电子健康记录数据中得出了一种令人生畏的信赖间隔,并提出了一种新的统计方法,即我们发现最脆弱分组在临床2 的治疗效果,在临床研究中确实具有高水平的遗传风险。