The problem of how to best select variables for confounding adjustment forms one of the key challenges in the evaluation of exposure effects in observational studies, and has been the subject of vigorous recent activity in causal inference. A major drawback of routine procedures is that there is no finite sample size at which they are guaranteed to deliver exposure effect estimators and associated confidence intervals with adequate performance. In this work, we will consider this problem when inferring conditional causal hazard ratios from observational studies under the assumption of no unmeasured confounding. The major complication that we face with survival data is that the key confounding variables may not be those that explain the censoring mechanism. In this paper, we overcome this problem using a novel and simple procedure that can be implemented using off-the-shelf software for penalized Cox regression. In particular, we will propose tests of the null hypothesis that the exposure has no effect on the considered survival endpoint, which are uniformly valid under standard sparsity conditions. Simulation results show that the proposed methods yield valid inferences even when covariates are high-dimensional.
翻译:在观察研究中,如何最好地选择影响调整的变量是评估暴露效应的关键挑战之一,也是最近因果推断中积极活动的主题。例行程序的一个主要缺点是,没有保证提供暴露效应估计器和相关的信任期的有限样本规模,保证它们能以适当的性能提供暴露效应估计器和相关的信任期。在这项工作中,我们将在假设没有不测混结的情况下,从观测研究中推断出有条件的因果危害比率时考虑这一问题。我们在生存数据中面临的主要复杂因素是,关键混淆变量可能不是解释审查机制的变量。在本文中,我们用一种新颖而简单的程序克服了这一问题,可以使用现成的软件来惩罚考克斯的回归。特别是,我们将提出一个无效假设的测试,即所考虑的存活终点不会受到任何影响,在标准的宽度条件下,这种影响是统一的。模拟结果表明,拟议的方法即使在共性高度的情况下,也会产生有效的推论。