Ranking problems, also known as preference learning problems, define a widely spread class of statistical learning problems with many applications, including fraud detection, document ranking, medicine, credit risk screening, image ranking or media memorability. In this article, we systematically review different types of instance ranking problems, i.e., ranking problems that require the prediction of an order of the response variables, and the corresponding loss functions resp. goodness criteria. We discuss the difficulties when trying to optimize those criteria. As for a detailed and comprehensive overview of existing machine learning techniques to solve such ranking problems, we systemize existing techniques and recapitulate the corresponding optimization problems in a unified notation. We also discuss to which of the ranking problems the respective algorithms are tailored and identify their strengths and limitations. Computational aspects and open research problems are also considered.
翻译:排名问题,也称为偏好学习问题,界定了广泛分布的统计学习问题类别,涉及许多应用,包括欺诈检测、文件排名、医学、信用风险筛选、图像排名或媒体租赁;在本条中,我们系统地审查不同类型的案例排名问题,即需要预测响应变量顺序的排名问题和相应的损失函数。我们讨论了在试图优化这些标准时遇到的困难。关于对现有机器学习技术进行详细和综合的概述,以解决这类排名问题,我们将现有技术系统化,并在统一标记中重新概括相应的优化问题。我们还讨论了各自的算法针对哪些排序问题,并确定了其优点和局限性。还考虑了计算方面和公开研究问题。