Title translation: 鲁棒共识在排名数据分析中的应用：定义、特性和计算问题 Abstract translation: 随着鲁棒性在人工智能系统中变得至关重要，必须开发出即使在部分污染数据存在的情况下仍然可靠的统计学习技术。在最简单的情况下，偏好数据以（完整的）排名形式出现，这种数据的适当概念和工具的需求更加紧迫，因为这种数据产生或消费的技术（如搜索引擎、推荐系统）现在正在广泛部署。但是，由于排序数据集合（即对称群$\mathfrak{S}_n$）没有向量空间结构，并且排名数据分析中所考虑的统计量的复杂性，因此在此领域中制定稳健性目标具有挑战性。在本文中，我们引入了鲁棒性的概念，以及针对共识排序的特定统计方法，共识排序是排序数据分析中的旗舰问题，旨在通过中位数分数来总结对$\mathfrak{S}_n$上的概率分布。精确地说，我们提出了定制的破坏点概念，专门针对共识排序，并解决了相关的计算问题。除了理论贡献外，所提出的方法的适用性得到了实验证明。 (Robust Consensus in Ranking Data Analysis: Definitions, Properties and Computational Issues)

翻译：Title translation: 鲁棒共识在排名数据分析中的应用：定义、特性和计算问题 Abstract translation: 随着鲁棒性在人工智能系统中变得至关重要，必须开发出即使在部分污染数据存在的情况下仍然可靠的统计学习技术。在最简单的情况下，偏好数据以（完整的）排名形式出现，这种数据的适当概念和工具的需求更加紧迫，因为这种数据产生或消费的技术（如搜索引擎、推荐系统）现在正在广泛部署。但是，由于排序数据集合（即对称群$\mathfrak{S}_n$）没有向量空间结构，并且排名数据分析中所考虑的统计量的复杂性，因此在此领域中制定稳健性目标具有挑战性。在本文中，我们引入了鲁棒性的概念，以及针对共识排序的特定统计方法，共识排序是排序数据分析中的旗舰问题，旨在通过中位数分数来总结对$\mathfrak{S}_n$上的概率分布。精确地说，我们提出了定制的破坏点概念，专门针对共识排序，并解决了相关的计算问题。除了理论贡献外，所提出的方法的适用性得到了实验证明。

Morgane Goibert,Clément Calauzènes,Ekhine Irurozki,Stéphan Clémençon

As the issue of robustness in AI systems becomes vital, statistical learning techniques that are reliable even in presence of partly contaminated data have to be developed. Preference data, in the form of (complete) rankings in the simplest situations, are no exception and the demand for appropriate concepts and tools is all the more pressing given that technologies fed by or producing this type of data (e.g. search engines, recommending systems) are now massively deployed. However, the lack of vector space structure for the set of rankings (i.e. the symmetric group $\mathfrak{S}_n$) and the complex nature of statistics considered in ranking data analysis make the formulation of robustness objectives in this domain challenging. In this paper, we introduce notions of robustness, together with dedicated statistical methods, for Consensus Ranking the flagship problem in ranking data analysis, aiming at summarizing a probability distribution on $\mathfrak{S}_n$ by a median ranking. Precisely, we propose specific extensions of the popular concept of breakdown point, tailored to consensus ranking, and address the related computational issues. Beyond the theoretical contributions, the relevance of the approach proposed is supported by an experimental study.

翻译：