Learning how to aggregate ranking lists has been an active research area for many years and its advances have played a vital role in many applications ranging from bioinformatics to internet commerce. The problem of discerning reliability of rankers based only on the rank data is of great interest to many practitioners, but has received less attention from researchers. By dividing the ranked entities into two disjoint groups, i.e., relevant and irrelevant/background ones, and incorporating the Mallows model for the relative ranking of relevant entities, we propose a framework for rank aggregation that can not only distinguish quality differences among the rankers but also provide the detailed ranking information for relevant entities. Theoretical properties of the proposed approach are established, and its advantages over existing approaches are demonstrated via simulation studies and real-data applications. Extensions of the proposed method to handle partial ranking lists and conduct covariate-assisted rank aggregation are also discussed.
翻译:多年来,学习如何汇总排名清单是一个积极的研究领域,其进展在许多应用领域,从生物信息学到互联网商务,都发挥了至关重要的作用。许多从业者对仅根据排名数据识别排名员可靠性的问题非常感兴趣,但从研究人员得到的注意较少。通过将排名实体分为两个互不相连的小组,即相关和不相关的小组/背景小组,并纳入关于相关实体相对排名的Mallows模式,我们提出了一个排名汇总框架,不仅可以区分排名员之间的质量差异,而且还可以为相关实体提供详细的排名信息。还确定了拟议方法的理论属性,并通过模拟研究和真实数据应用展示了其相对于现有方法的优势。还讨论了扩大拟议的处理部分排名清单和进行共变分类汇总的方法。