Many real-world processes are trajectories that may be regarded as continuous-time "functional data". Examples include patients' biomarker concentrations, environmental pollutant levels, and prices of stocks. Corresponding advances in data collection have yielded near continuous-time measurements, from e.g. physiological monitors, wearable digital devices, and environmental sensors. Statistical methodology for estimating the causal effect of a time-varying treatment, measured discretely in time, is well developed. But discrete-time methods like the g-formula, structural nested models, and marginal structural models do not generalize easily to continuous time, due to the entanglement of uncountably infinite variables. Moreover, researchers have shown that the choice of discretization time scale can seriously affect the quality of causal inferences about the effects of an intervention. In this paper, we establish causal identification results for continuous-time treatment-outcome relationships for general cadlag stochastic processes under continuous-time confounding, through orthogonalization and weighting. We use three concrete running examples to demonstrate the plausibility of our identification assumptions, as well as their connections to the discrete-time g methods literature.
翻译:许多现实世界过程是可被视为连续时间“功能数据”的轨迹。例子包括病人的生物标志浓度、环境污染物水平和库存价格。数据收集的相应进展已经产生了几乎连续时间的测量结果,例如生理监测器、可磨损的数字装置和环境传感器。估计时间变化治疗的因果影响的统计方法已经很完善。但是,诸如 g-形态、结构嵌套模型和边缘结构模型等离散时间方法,由于不可估量的无限变数的缠绕,不易概括为持续时间。此外,研究人员已经表明,选择离散时间尺度可以严重影响对干预效果的因果关系推断的质量。在本文中,我们为一般炉渣在连续时间折叠、通过或分解和加权情况下的连续治疗-结果关系确定了因果关系结果。我们用三个具体运行的实例来证明我们的识别假设的可辨性,以及它们与离散文献的关联性。