Identifying prognostic factors for disease progression is a cornerstone of medical research. Repeated assessments of a marker outcome are often used to evaluate disease progression, and the primary research question is to identify factors associated with the longitudinal trajectory of this marker. Our work is motivated by diabetic kidney disease (DKD), where serial measures of estimated glomerular filtration rate (eGFR) are the longitudinal measure of kidney function, and there is notable interest in identifying factors, such as metabolites, that are prognostic for DKD progression. Linear mixed models (LMM) with serial marker outcomes (e.g., eGFR) are a standard approach for prognostic model development, namely by evaluating the time and prognostic factor (e.g., metabolite) interaction. However, two-stage methods that first estimate individual-specific eGFR slopes, and then use these as outcomes in a regression framework with metabolites as predictors are easy to interpret and implement for applied researchers. Herein, we compared the LMM and two-stage methods, in terms of bias and mean squared error via analytic methods and simulations, allowing for irregularly spaced measures and missingness. Our findings provide novel insights into when two-stage methods are suitable longitudinal prognostic modeling alternatives to the LMM. Notably, our findings generalize to other disease studies.
翻译:反复评估标记结果的线性混合模型(LMM)与序列标记结果(例如,eGFR)经常被用来评估与该标记的纵向轨迹相关的因素,而首要研究问题是确定与该标记的纵向轨迹相关的因素。我们的工作是由糖尿病肾病(DKD)驱动的,在那里,估计球状过滤率(eGFR)的序列测量是肾功能的纵向测量,人们非常希望确定对DKD进展具有预测性的因素,例如代谢物等对DKD进展具有预测性的因素。具有序列标记结果(例如,eGFR)的线性混合模型(LMMM)经常用来评估与该标记的纵向轨迹径径径径径径相关的因素。我们的工作是评估时间和预测性因素(例如,代谢物)之间的相互作用。然而,首先评估个人特定的球状体过滤率斜坡度(eGFR)的垂直度的测算,然后将这些结果作为回归框架的结果,作为预测剂易于解释和实施应用研究人员。在这里,我们将LMMMM和两阶段方法进行比较,从偏见和两阶段方法,通过模拟将我们的标准误判误判方法提供。