In large-scale recommender systems, retrieving top N relevant candidates accurately with resource constrain is crucial. To evaluate the performance of such retrieval models, Recall@N, the frequency of positive samples being retrieved in the top N ranking, is widely used. However, most of the conventional loss functions for retrieval models such as softmax cross-entropy and pairwise comparison methods do not directly optimize Recall@N. Moreover, those conventional loss functions cannot be customized for the specific retrieval size N required by each application and thus may lead to sub-optimal performance. In this paper, we proposed the Customizable Recall@N Optimization Loss (CROLoss), a loss function that can directly optimize the Recall@N metrics and is customizable for different choices of N. This proposed CROLoss formulation defines a more generalized loss function space, covering most of the conventional loss functions as special cases. Furthermore, we develop the Lambda method, a gradient-based method that invites more flexibility and can further boost the system performance. We evaluate the proposed CROLoss on two public benchmark datasets. The results show that CROLoss achieves SOTA results over conventional loss functions for both datasets with various choices of retrieval size N. CROLoss has been deployed onto our online E-commerce advertising platform, where a fourteen-day online A/B test demonstrated that CROLoss contributes to a significant business revenue growth of 4.75%.
翻译:在大型推荐人系统中,以资源限制来准确检索最高级N相关候选人至关重要。为了评价这类检索模型的性能,我们广泛使用在最N级中检索的积极样本的频率Recall@N。然而,大多数常规的检索模型损失功能,如软麦克斯交叉成激素和对称比较方法,并不直接优化回调@N。此外,这些常规损失功能不能按每个应用程序要求的具体检索大小量定制,从而可能导致亚最佳性能。在本文中,我们提议了可自定义的回调@N最佳化损失(CROLos),这是一个可以直接优化回调@N计量的频率,并为N的不同选择定制。提议的CROLos公式定义了一个更普遍的损失功能空间,将大多数常规损失功能作为特例加以覆盖。此外,我们开发了兰巴达方法,一种基于梯度的方法,可以要求更大的灵活性,从而可以进一步提升系统业绩。我们评估了两个公共基准数据集的拟议CROLos(CROLos),这是一个损失函数,可以直接优化Recall@NDOLs reportalal lavestiumal ad adal ad lavestical ad lavestistral laves laft laves laves laps laves laveal laves laves laves laps lautes