Detection limits (DLs), where a variable is unable to be measured outside of a certain range, are common in research. Most approaches to handle DLs in the response variable implicitly make parametric assumptions on the distribution of data outside DLs. We propose a new approach to deal with DLs based on a widely used ordinal regression model, the cumulative probability model (CPM). The CPM is a type of semiparametric linear transformation model. CPMs are rank-based and can handle mixed distributions of continuous and discrete outcome variables. These features are key for analyzing data with DLs because while observations inside DLs are typically continuous, those outside DLs are censored and generally put into discrete categories. With a single lower DL, the CPM assigns values below the DL as having the lowest rank. When there are multiple DLs, the CPM likelihood can be modified to appropriately distribute probability mass. We demonstrate the use of CPMs with simulations and two HIV data examples. The first example models a biomarker in which 15% of observations are below a DL. The second uses multi-cohort data to model viral load, where approximately 55% of observations are outside DLs which vary across sites and over time.
翻译:在某一范围之外无法测量变量的检测限度(DLs),在某一范围之外无法测量变量的检测限度(DLs)是研究中常见的。在反应变量中处理DLs的大多数处理DLs的方法都隐含地假设DLs之外数据分布的参数性假设。我们建议了一种基于广泛使用的圆形回归模型,即累积概率模型(CPM)处理DL的新方法。CPM是一种半对数线性线性变异模型。CPM是一类半参数性线性变异模型。CPM是按等级排列的,可以处理连续和离散结果变量的混合分布。这些特征是分析DLs数据的关键,因为DLs内部的观测通常是连续的,DLs以外的观测通常会受到检查,DLs以外的观测一般会被置于离散类别中。如果使用一个较低的 DL,CPM 将D 下值定为最低等级。如果有多个DL,CPM 的可能性可以修改以适当分配概率质量。我们用模拟和两个艾滋病毒数据实例来证明CPMs的使用情况。第一个生物标记器,其中15%的观测点在DL以下。第二,则使用多-cohootL。