The need to measure the degree of agreement among R raters who independently classify n subjects within K nominal categories is frequent in many scientific areas. The most popular measures are Cohen's kappa (R = 2), Fleiss' kappa, Conger's kappa and Hubert's kappa (R $\geq$ 2) coefficients, which have several defects. In 2004, the delta coefficient was defined for the case of R = 2, which did not have the defects of Cohen's kappa coefficient. This article extends the coefficient delta from R = 2 raters to R $\geq$ 2. The coefficient multi-rater delta has the same advantages as the coefficient delta with regard to the type kappa coefficients: i) it is intuitive and easy to interpret, because it refers to the proportion of replies that are concordant and non random; ii) the summands which give its value allow the degree of agreement in each category to be measured accurately, with no need to be collapsed; and iii) it is not affected by the marginal imbalance.
翻译:在许多科学领域,独立区分K名义类别内n主题的R评级人之间需要达到一致的程度是常见的。最受欢迎的措施是科恩的kappa(R=2)、Fleiss的kappa、Conger的kappa和Hubert的kappa(R $\geq$2)系数,这些系数有几个缺陷。2004年,为R =2的情况确定了三角洲系数,该系数没有科恩的kappa系数的缺陷。本条将系数三角洲从R=2扩大到R $\geq$2. 系数多拉特三角洲与系数三角洲具有相同的优势,在Kappa系数类型方面:(一) 该系数是直观的,易于解释,因为它提到答复的比例是一致的和非随机的;二)给出其价值的总和允许准确测量每一类别的协议程度,而无需崩溃;三)它不受边缘不平衡的影响。