We develop a functional proportional hazards mixture cure (FPHMC) model with scalar and functional covariates measured at the baseline. The mixture cure model, useful in studying populations with a cure fraction of a particular event of interest is extended to functional data. We employ the EM algorithm and develop a semiparametric penalized spline-based approach to estimate the dynamic functional coefficients of the incidence and the latency part. The proposed method is computationally efficient and simultaneously incorporates smoothness in the estimated functional coefficients via roughness penalty. Simulation studies illustrate a satisfactory performance of the proposed method in accurately estimating the model parameters and the baseline survival function. Finally, the clinical potential of the model is demonstrated in two real data examples that incorporate rich high-dimensional biomedical signals as functional covariates measured at the baseline and constitute novel domains to apply cure survival models in contemporary medical situations. In particular, we analyze i) minute-by-minute physical activity data from the National Health and Nutrition Examination Survey (NHANES) 2003-2006 to study the association between diurnal patterns of physical activity (PA) at baseline and all cancer mortality through 2019 while adjusting for other biological factors; ii) the impact of daily functional measures of disease severity collected in the intensive care unit on post ICU recovery and mortality event. Our findings provide novel epidemiological insights into the association between daily patterns of PA and cancer mortality. Software implementation and illustration of the proposed estimation method is provided in R.
翻译:我们开发了一种用于基线处量和功能协变量的功能比例风险混合治愈(FPHMC)模型。这种混合治愈模型适用于研究特定感兴趣事件的治愈比例的人群,并扩展到功能数据。我们采用EM算法,并开发了一种半参数惩罚样条技术来估计发病和潜伏期的动态功能系数。所提出的方法在估计模型参数和基线生存函数方面表现出较高的计算效率。模拟研究说明了所提出的方法在准确估计模型参数和基线生存函数方面的满意表现。最后,我们通过两个真实数据示例来演示模型的临床潜力。这两个数据都包括基线测量的丰富的高维生物医学信号作为功能协变量,并构成了当前医学领域中将治愈生存模型应用于新颖领域的案例。特别地,我们分析了来自NHANES 2003-2006的每分钟身体活动数据,以研究基线贯穿整天的身体活动模式与2019年所有癌症死亡之间的关联,同时调整其他生物因素;另外,我们还探究了集中治疗单元收集的每日功能严重程度措施对后ICU恢复和死亡事件的影响。我们的研究结果提供了关于每日身体活动模式与癌症死亡之间关联的新颖流行病学洞见。该估计方法的软件实现及示例提供在R软件下。