Informative cluster size (ICS) arises in situations with clustered data where a latent relationship exists between the number of participants in a cluster and the outcome measures. Although this phenomenon has been sporadically reported in statistical literature for nearly two decades now, further exploration is needed in certain statistical methodologies to avoid potentially misleading inferences. For inference about population quantities without covariates, inverse cluster size reweightings are often employed to adjust for ICS. Further, to study the effect of covariates on disease progression described by a multistate model, the pseudo-value regression technique has gained popularity in time-to-event data analysis. We seek to answer the question: "How to apply pseudo-value regression to clustered time-to-event data when cluster size is informative?" ICS adjustment by the reweighting method can be performed in two steps; estimation of marginal functions of the multistate model and fitting the estimating equations based on pseudo-value responses, leading to four possible strategies. We present theoretical arguments and thorough simulation experiments to ascertain the correct strategy for adjusting for ICS. A further extension of our methodology is implemented to include informativeness induced by the intra-cluster group size. We demonstrate the methods in two real-world applications: (i) to determine predictors of tooth survival in a periodontal study, and (ii) to identify indicators of ambulatory recovery in spinal cord injury patients who participated in locomotor-training rehabilitation.
翻译:在一个集群参与人数和结果计量之间的潜在关系存在集群数据的情况下,产生了知情群集规模(ICS) 。虽然这一现象在统计文献中已经零星报告了近20年,但需要对某些统计方法进行进一步探讨,以避免潜在的误导推论。为了推断人口数量而不出现共差,往往采用反集群规模的重新加权来调整ICS。此外,为了研究多种国家模型描述的共变对疾病增加的影响,假价值回归技术在时间到时间到活动的数据分析中越来越受欢迎。我们寻求回答一个问题:“在集群规模信息丰富时,如何对集群时间到活动的数据采用假价值回归? ICS根据重算法进行调整,可以分两步进行;估计多国家模型的边际功能,根据假价值对策调整估计方程式,导致四种可能的战略。我们提出理论论和彻底的模拟实验,以确定ICS调整的正确战略。我们进一步扩展了我们的方法,将信息回溯性回归到集群规模上,在OFI研究中,确定公司参与的恢复期。