Clustered data are common in biomedical research. Observations in the same cluster are often more similar to each other than to observations from other clusters. The intraclass correlation coefficient (ICC), first introduced by R. A. Fisher, is frequently used to measure this degree of similarity. However, the ICC is sensitive to extreme values and skewed distributions, and depends on the scale of the data. It is also not applicable to ordered categorical data. We define the rank ICC as a natural extension of Fisher's ICC to the rank scale, and describe its corresponding population parameter. The rank ICC is simply interpreted as the rank correlation between a random pair of observations from the same cluster. We also extend the definition when the underlying distribution has more than two hierarchies. We describe estimation and inference procedures, show the asymptotic properties of our estimator, conduct simulations to evaluate its performance, and illustrate our method in three real data examples with skewed data, count data, and three-level data.
翻译:分组数据在生物医学研究中很常见。同一组中的观测与其他组群的观测相比,往往更为相似。首先由R. A. Fisher引入的类内相关系数(ICC),经常用来测量这种相似程度。然而,国际商会对极端值和偏斜分布十分敏感,并取决于数据的规模。它也不适用于有命令的绝对数据。我们把国际商会的等级定义为Fisher's ICC的自然扩展至等级尺度,并描述其相应的人口参数。将国际商会的等级简单地解释为同一组群随机观测对等之间的等级相关性。当基本分布超过两个等级时,我们还将定义扩大。我们描述估计和推断程序,显示我们的估算器的无约束特性,进行模拟以评价其性能,并在三个真实数据示例中用斜度数据、计数数据和三级数据说明我们的方法。</s>