This paper reports on a follow-up study of the work reported in Sakai, which explored suitable evaluation measures for ordinal quantification tasks. More specifically, the present study defines and evaluates, in addition to the quantification measures considered earlier, a few variants of an ordinal quantification measure called Root Normalised Order-aware Divergence (RNOD), as well as a measure which we call Divergence based on Kendall's $\tau$ (DNKT). The RNOD variants represent alternative design choices based on the idea of Sakai's Distance-Weighted sum of squares (DW), while DNKT is designed to ensure that the system's estimated distribution over classes is faithful to the target priorities over classes. As this Priority Preserving Property (PPP) of DNKT may be useful in some applications, we also consider combining some of the existing quantification measures with DNKT. Our experiments with eight ordinal quantification data sets suggest that the variants of RNOD do not offer any benefit over the original RNOD at least in terms of system ranking consistency, i.e., robustness of the system ranking to the choice of test data. Of all ordinal quantification measures considered in this study (including Normalised Match Distance, a.k.a. Earth Mover's Distance), RNOD is the most robust measure overall. Hence the design choice of RNOD is a good one from this viewpoint. Also, DNKT is the worst performer in terms of system ranking consistency. Hence, if DNKT seems appropriate for a task, sample size design should take its statistical instability into account.
翻译:本文报告了对Sakai公司所报告工作的后续研究,该研究探讨了对正方形的距离-重量之和(DW)的适当评价措施。更具体地说,本研究除了界定和评价先前考虑的量化措施外,还界定和评价了被称为“根正态定序觉知异(RNOD)”,以及我们根据Kendall $\tau(DNKT)称为“差异”的一项措施。RNOD变量代表了基于Sakai的距离-重量之和正方形(DW)概念的替代设计选择,而DNKT则旨在确保系统对各类分配的估计符合各类的目标优先事项。由于DNKT的优先保留属性(PPP)在某些应用中可能有用,我们还考虑将一些现有的量化措施与DNKT(DNKT)相结合。我们用8个或分级量化数据集进行的实验也表明,RNOD的变数值对于最初的RNOD(RNOD)而言,至少在系统排序一致性方面没有任何好处,即,MCT的准确性定义值,在这种系统上,在进行这一深度设计中,其最精确程度的排序中应该采用一个正常的排序。