In the absence of data from a randomized trial, researchers often aim to use observational data to draw causal inference about the effect of a treatment on a time-to-event outcome. In this context, interest often focuses on the treatment-specific survival curves; that is, the survival curves were the entire population under study to be assigned to receive the treatment or not. Under certain causal conditions, including that all confounders of the treatment-outcome relationship are observed, the treatment-specific survival can be identified with a covariate-adjusted survival function. Several estimators of this function have been proposed, including estimators based on outcome regression, inverse probability weighting, and doubly robust estimators. In this article, we propose a new cross-fitted doubly-robust estimator that incorporates data-adaptive (e.g. machine learning) estimators of the conditional survival functions. We establish conditions on the nuisance estimators under which our estimator is consistent and asymptotically linear, both pointwise and uniformly in time. We also propose a novel ensemble learner for combining multiple candidate estimators of the conditional survival estimators. Notably, our methods and results accommodate events occurring in discrete or continuous time (or both). We investigate the practical performance of our methods using numerical studies and an application to the effect of a surgical treatment to prevent metastases of parotid carcinoma on mortality.
翻译:在缺乏随机试验数据的情况下,研究人员往往试图利用观察数据来推断治疗对时间到活动结果的影响。在这方面,人们的兴趣往往集中在特定治疗的存活曲线上;也就是说,生存曲线是正在研究的全体人口被分配接受治疗;在某些因果关系条件下,包括观察到治疗结果关系的所有混杂者都观察到了有条件生存功能的估算,因此可以确定特定治疗的存活率,并设定一种共变调整生存功能。已经提议了一些对这一功能的估算者,包括基于结果回归的估测器、偏差权重重力和双倍强的估算器。在本篇文章中,我们提出了一个新的交叉搭配的双曲线估算器估算器,将数据适应(e.g.机器学习)的估测器纳入有条件生存功能的估测器。我们设定了我们估算的估算器在实际效果上是一致的,并且是线性直的。我们还提出了在时间上将我们快速和稳定性估算器的精确度应用方法结合起来。我们还提议了一个将长期性研究结果与连续性研究结果相结合的新方法。