Recent works have revealed an essential paradigm in designing loss functions that differentiate individual losses vs. aggregate losses. The individual loss measures the quality of the model on a sample, while the aggregate loss combines individual losses/scores over each training sample. Both have a common procedure that aggregates a set of individual values to a single numerical value. The ranking order reflects the most fundamental relation among individual values in designing losses. In addition, decomposability, in which a loss can be decomposed into an ensemble of individual terms, becomes a significant property of organizing losses/scores. This survey provides a systematic and comprehensive review of rank-based decomposable losses in machine learning. Specifically, we provide a new taxonomy of loss functions that follows the perspectives of aggregate loss and individual loss. We identify the aggregator to form such losses, which are examples of set functions. We organize the rank-based decomposable losses into eight categories. Following these categories, we review the literature on rank-based aggregate losses and rank-based individual losses. We describe general formulas for these losses and connect them with existing research topics. We also suggest future research directions spanning unexplored, remaining, and emerging issues in rank-based decomposable losses.
翻译:最近的工作揭示了设计区分个人损失相对于总损失的损失的损失功能的基本范式; 个人损失衡量了抽样模型的质量,而总损失则将每个培训样本中的个别损失/分数结合起来; 两者都有一套共同的程序,将一套个人价值汇总为一个单一的数值; 排序顺序反映了设计损失中个人价值之间的最根本关系; 此外, 一种损失可以分解成一个单个术语组合的脱位性成为组织损失/分数的重要属性; 个人损失是组织损失/分数的重大属性; 本次调查对机器学习中基于等级的可分解损失进行系统和全面的审查。 具体地说,我们提供了一种遵循累计损失和个别损失视角的新的损失功能分类方法。 我们还确定了形成这种损失的分类方法,这是设定功能的例子。 我们将基于等级的可分解损失分为八个类别。 我们根据这些分类,我们审查了关于基于等级的总损失和基于等级的个人损失的文献。 我们描述了这些损失的一般公式,并将这些损失与现有的研究专题联系起来。 我们还建议了未来研究方向, 划分了未计量的损失。