Embedding-based retrieval (EBR) methods are widely used in modern recommender systems thanks to its simplicity and effectiveness. However, along the journey of deploying and iterating on EBR in production, we still identify some fundamental issues in existing methods. First, when dealing with large corpus of candidate items, EBR models often have difficulties in balancing the performance on distinguishing highly relevant items (positives) from both irrelevant ones (easy negatives) and from somewhat related yet not competitive ones (hard negatives). Also, we have little control in the diversity and fairness of the retrieval results because of the ``greedy'' nature of nearest vector search. These issues compromise the performance of EBR methods in large-scale industrial scenarios. This paper introduces a simple and proven-in-production solution to overcome these issues. The proposed solution takes a divide-and-conquer approach: the whole set of candidate items are divided into multiple clusters and we run EBR to retrieve relevant candidates from each cluster in parallel; top candidates from each cluster are then combined by some controllable merging strategies. This approach allows our EBR models to only concentrate on discriminating positives from mostly hard negatives. It also enables further improvement from a multi-tasking learning (MTL) perspective: retrieval problems within each cluster can be regarded as individual tasks; inspired by recent successes in prompting and prefix-tuning, we propose an efficient task adaption technique further boosting the retrieval performance within each cluster with negligible overheads.
翻译:在现代建议系统中广泛使用基于嵌入的检索(EBR)方法,因为其简单和有效,因此在现代建议系统中广泛使用基于嵌入的检索(EBR)方法。然而,在EBR生产过程中的部署和循环过程中,我们仍然发现现有方法中的一些根本问题。首先,在处理大量候选项目时,EBR模型往往难以平衡区分高度相关项目(积极)的性能(积极性)与不相关项目(消极性)和相对相关但并不具有竞争力的项目(硬性负性)的性能。此外,由于“greedy”是距离最近的矢量搜索的性质,因此我们对检索结果的多样性和公正性几乎没有控制。由于这些问题,我们EBR模型只能在大规模工业情景中损害EBR方法的性能。本文提出了克服这些问题的简单和经过验证的在生产过程中采用的解决办法。 所提议的解决办法采取分而有区别的办法:整个候选项目组分为多个组群,我们运行EBRRR,以便从每个组同时检索相关候选人;然后由某些可控制的合并战略将每个组的高级候选人合并。这种方法使得我们的EBRBS模型只能专注于在从多数的微的回收中选择中,从每个组合中学习。