Handling missing values plays an important role in the analysis of survival data, especially, the ones marked by cure fraction. In this paper, we discuss the properties and implementation of stochastic approximations to the expectation-maximization (EM) algorithm to obtain maximum likelihood (ML) type estimates in situations where missing data arise naturally due to right censoring and a proportion of individuals are immune to the event of interest. A flexible family of three parameter exponentiated-Weibull (EW) distributions is assumed to characterize lifetimes of the non-immune individuals as it accommodates both monotone (increasing and decreasing) and non-monotone (unimodal and bathtub) hazard functions. To evaluate the performance of the SEM algorithm, an extensive simulation study is carried out under various parameter settings. Using likelihood ratio test we also carry out model discrimination within the EW family of distributions. Furthermore, we study the robustness of the SEM algorithm with respect to outliers and algorithm starting values. Few scenarios where stochastic EM (SEM) algorithm outperforms the well-studied EM algorithm are also examined in the given context. For further demonstration, a real survival data on cutaneous melanoma is analyzed using the proposed cure rate model with EW lifetime distribution and the proposed estimation technique. Through this data, we illustrate the applicability of the likelihood ratio test towards rejecting several well-known lifetime distributions that are nested within the wider class of EW distributions.
翻译:处理缺失值在分析生存数据中起着重要作用, 特别是以治愈分数为标志的值。 在本文中, 我们讨论与预期- 最大化( EM) 危险功能相匹配的随机近似值的属性和实施。 为了评估SEM 算法的性能, 在不同参数设置下进行广泛的模拟研究, 利用可能性比测试, 我们还在EW 分布的家族中进行模型歧视。 此外, 我们研究SEM 算法在外端值和算法起始值方面的强度。 很少有人知道非模拟个人在满足单质( 增减) 和非摩托内( 单式和 浴) 危险功能时的寿命周期性近近近效值。 为了评估SEM 算法的性能, 在各种参数设置下进行广泛的模拟研究。 我们还利用可能性比值测试, 在分布时进行模型- 和算法的起始值 值 。 已知的EM EM ( SEM) 算法比得更精确的周期性分布, 也用真实的缩算法分析 。