Informative cluster size (ICS) arises in situations with clustered data where a latent relationship exists between the number of participants in a cluster and the outcome measures. Although this phenomenon has been sporadically reported in statistical literature for nearly two decades now, further exploration is needed in certain statistical methodologies to avoid potentially misleading inferences. For inference about population quantities without covariates, inverse cluster size reweightings are often employed to adjust for ICS. Further, to study the effect of covariates on disease progression described by a multistate model, the pseudo-value regression technique has gained popularity in time-to-event data analysis. We seek to answer the question: "How to apply pseudo-value regression to clustered time-to-event data when cluster size is informative?" ICS adjustment by the reweighting method can be performed in two steps; estimation of marginal functions of the multistate model and fitting the estimating equations based on pseudo-value responses, leading to four possible strategies. We present theoretical arguments and thorough simulation experiments to ascertain the correct strategy for adjusting for ICS. A further extension of our methodology is implemented to include informativeness induced by the intra-cluster group size. We demonstrate the methods in two real-world applications: (i) to determine predictors of tooth survival in a periodontal study, and (ii) to identify indicators of ambulatory recovery in spinal cord injury patients who participated in locomotor-training rehabilitation.
翻译:----
信息聚类大小(ICS)发生在聚类数据中存在潜在关系的情况下,而参与者数量和结果测量之间存在联系。虽然这种现象在统计文献中已经零散报告了近20年,但在某些统计方法中仍需进一步探索以避免潜在误导性的推断。对于没有协变量的总体量推断,通常采用逆聚类大小加权法来调整ICS。此外,在描述多状态模型的疾病进展情况方面,虚值回归技术在时间至事件数据分析中变得越来越受欢迎。我们的问题是:“在信息聚类大小具有信息时,如何将虚值回归应用于聚类时间至事件数据?”通过重新加权方法进行ICS调整可以分为两个步骤:估计多状态模型的边际函数和基于虚值响应拟合估计方程,从而导致四种可能的策略。我们提出了理论论证和详细的模拟实验,以确定适用于ICS调整的正确策略。我们还通过包含的聚类组大小的信息性差异进行了进一步扩展。我们在两个真实应用中演示了这些方法:(一)确定牙周疾病研究中的牙齿存活的预测因子,(二)鉴别脊髓损伤患者康复的迹象,这些患者参加了步态训练康复。