Continuous response variables often need to be transformed to meet regression modeling assumptions; however, finding the optimal transformation is challenging and results may vary with the choice of transformation. When a continuous response variable is measured repeatedly for a subject or the continuous responses arise from clusters, it is more challenging to model the continuous response data due to correlation within clusters. We extend a widely used ordinal regression model, the cumulative probability model (CPM), to fit clustered continuous response variables based on generalized estimating equation (GEE) methods for ordinal responses. With our approach, estimates of marginal parameters, cumulative distribution functions (CDFs), expectations, and quantiles conditional on covariates can be obtained without pre-transformation of the potentially skewed continuous response data. Computational challenges arise with large numbers of distinct values of the continuous response variable, and we propose two feasible and computationally efficient approaches to fit CPMs for clustered continuous response variables with different working correlation structures. We study finite sample operating characteristics of the estimators via simulation, and illustrate their implementation with two data examples. One studies predictors of CD4:CD8 ratios in an HIV study. The other uses data from The Lung Health Study to investigate the contribution of a single nucleotide polymorphism to lung function decline.
翻译:连续应对变量往往需要转换,以适应回归模型假设;然而,发现最佳转型是挑战性的,结果可能随转换的选择而变化。当对一个主题反复测量连续应对变量时,或者由于组群内部的关联性而产生连续应对响应数据时,由于集群内部的关联性而反复计量连续应对数据则更具挑战性。我们推广了一种广泛使用的统称回归模型,即累积概率模型(CPM),以基于通用估计方程式(GEEE)的组合式连续应对变量为基础,以组合组合组合组合应对响应变量。我们利用我们的方法,可以取得边际参数、累积分布函数(CDFS)、预期值和以共变数为条件的量化变量的估计数,而不必预先变换潜在扭曲的连续应对数据。计算挑战随着连续应对变量的众多不同值而出现。我们提出了两种可行的计算高效方法,以匹配基于不同工作关联性结构的组合组合连续应对变量。我们通过模拟研究估算者有限的抽样操作特点,并以两个数据示例来说明其执行情况。一项研究是CD4:H4:CD8的CD8比率比率对艾滋病毒循环研究的其他数据。