Nowadays, the product search service of e-commerce platforms has become a vital shopping channel in people's life. The retrieval phase of products determines the search system's quality and gradually attracts researchers' attention. Retrieving the most relevant products from a large-scale corpus while preserving personalized user characteristics remains an open question. Recent approaches in this domain have mainly focused on embedding-based retrieval (EBR) systems. However, after a long period of practice on Taobao, we find that the performance of the EBR system is dramatically degraded due to its: (1) low relevance with a given query and (2) discrepancy between the training and inference phases. Therefore, we propose a novel and practical embedding-based product retrieval model, named Multi-Grained Deep Semantic Product Retrieval (MGDSPR). Specifically, we first identify the inconsistency between the training and inference stages, and then use the softmax cross-entropy loss as the training objective, which achieves better performance and faster convergence. Two efficient methods are further proposed to improve retrieval relevance, including smoothing noisy training data and generating relevance-improving hard negative samples without requiring extra knowledge and training procedures. We evaluate MGDSPR on Taobao Product Search with significant metrics gains observed in offline experiments and online A/B tests. MGDSPR has been successfully deployed to the existing multi-channel retrieval system in Taobao Search. We also introduce the online deployment scheme and share practical lessons of our retrieval system to contribute to the community.
翻译:目前,电子商务平台的产品搜索服务已成为人们生活中一个至关重要的购物渠道。产品的检索阶段决定了搜索系统的质量,并逐渐吸引研究人员的注意。从大型产品库中检索最相关的产品,同时保留个性化用户特性仍然是一个未决问题。这一领域最近的做法主要侧重于嵌入检索系统(EBR),然而,经过对道保的长期实践,我们发现EBR系统的性能由于以下原因急剧退化:(1) 与特定查询的相关性低,(2) 培训和推断阶段之间的差异。因此,我们建议采用一个新的和实用的嵌入产品检索模式,名为多层深层产品检索(MGDSPR)。具体地说,我们首先确定培训和推断阶段之间的不一致,然后将软体跨作物损失作为培训目标,从而实现更好的业绩和更快的趋同。我们进一步提出了两种有效的方法来改进检索相关性,包括调整调培训数据,并改进基于嵌入的硬层产品回收模式,即称为多层深层深度深度深层产品检索模型(MARPR) 。我们还成功地评估了在线搜索系统对AMAG系统进行在线搜索和升级。我们所观察到的系统进行在线搜索的系统。我们已进行了在线搜索和测试。