CROLoss: 争取使建议系统检索模型的可自定义损失 (CROLoss: Towards a Customizable Loss for Retrieval Models in Recommender Systems)

In large-scale recommender systems, retrieving top N relevant candidates accurately with resource constrain is crucial. To evaluate the performance of such retrieval models, Recall@N, the frequency of positive samples being retrieved in the top N ranking, is widely used. However, most of the conventional loss functions for retrieval models such as softmax cross-entropy and pairwise comparison methods do not directly optimize Recall@N. Moreover, those conventional loss functions cannot be customized for the specific retrieval size N required by each application and thus may lead to sub-optimal performance. In this paper, we proposed the Customizable Recall@N Optimization Loss (CROLoss), a loss function that can directly optimize the Recall@N metrics and is customizable for different choices of N. This proposed CROLoss formulation defines a more generalized loss function space, covering most of the conventional loss functions as special cases. Furthermore, we develop the Lambda method, a gradient-based method that invites more flexibility and can further boost the system performance. We evaluate the proposed CROLoss on two public benchmark datasets. The results show that CROLoss achieves SOTA results over conventional loss functions for both datasets with various choices of retrieval size N. CROLoss has been deployed onto our online E-commerce advertising platform, where a fourteen-day online A/B test demonstrated that CROLoss contributes to a significant business revenue growth of 4.75%.

翻译：在大型推荐人系统中,以资源限制来准确检索最高级N相关候选人至关重要。为了评价这类检索模型的性能,我们广泛使用在最N级中检索的积极样本的频率Recall@N。然而,大多数常规的检索模型损失功能,如软麦克斯交叉成激素和对称比较方法,并不直接优化回调@N。此外,这些常规损失功能不能按每个应用程序要求的具体检索大小量定制,从而可能导致亚最佳性能。在本文中,我们提议了可自定义的回调@N最佳化损失(CROLos),这是一个可以直接优化回调@N计量的频率,并为N的不同选择定制。提议的CROLos公式定义了一个更普遍的损失功能空间,将大多数常规损失功能作为特例加以覆盖。此外,我们开发了兰巴达方法,一种基于梯度的方法,可以要求更大的灵活性,从而可以进一步提升系统业绩。我们评估了两个公共基准数据集的拟议CROLos(CROLos),这是一个损失函数,可以直接优化Recall@NDOLs reportalal lavestiumal ad adal ad lavestical ad lavestistral laves laft laves laves laps laves laveal laves laves laves laps lautes

相关内容

损失函数（机器学习）

关注 10

损失函数，在AI中亦称呼距离函数，度量函数。此处的距离代表的是抽象性的，代表真实数据与预测数据之间的误差。损失函数（loss function）是用来估量你模型的预测值f(x)与真实值Y的不一致程度，它是一个非负实值函数,通常使用L(Y, f(x))来表示，损失函数越小，模型的鲁棒性就越好。损失函数是经验风险函数的核心部分，也是结构风险函数重要组成部分。

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日