ListBERT: 学习将电子商务产品排在列表BERT名下 (ListBERT: Learning to Rank E-commerce products with Listwise BERT)

Efficient search is a critical component for an e-commerce platform with an innumerable number of products. Every day millions of users search for products pertaining to their needs. Thus, showing the relevant products on the top will enhance the user experience. In this work, we propose a novel approach of fusing a transformer-based model with various listwise loss functions for ranking e-commerce products, given a user query. We pre-train a RoBERTa model over a fashion e-commerce corpus and fine-tune it using different listwise loss functions. Our experiments indicate that the RoBERTa model fine-tuned with an NDCG based surrogate loss function(approxNDCG) achieves an NDCG improvement of 13.9% compared to other popular listwise loss functions like ListNET and ListMLE. The approxNDCG based RoBERTa model also achieves an NDCG improvement of 20.6% compared to the pairwise RankNet based RoBERTa model. We call our methodology of directly optimizing the RoBERTa model in an end-to-end manner with a listwise surrogate loss function as ListBERT. Since there is a low latency requirement in a real-time search setting, we show how these models can be easily adopted by using a knowledge distillation technique to learn a representation-focused student model that can be easily deployed and leads to ~10 times lower ranking latency.

翻译：高效搜索是电子商务平台的关键组成部分,该平台拥有无数的产品。每天有数百万用户根据自己的需要搜索产品。因此, 在顶端展示相关产品将提高用户经验。在这项工作中, 我们提出一种新的方法, 将基于变压器的模型与各种列表式损失功能相结合, 用于电子商务产品排序。我们预先用不同的列表损失功能, 将一个时装电子商务平台的 RoBERTA 模型培训成一个模型, 并使用不同的列表损失功能进行精细调整。我们的实验表明, RoBERTA 模型与基于 NDCG的代理损失功能( aproxNDCG) 相比, 将提高NDCG 的13.9%, 比起 ListNET 和 ListMLE 等受欢迎的列表损失功能。以 ExmlBERTA 模型为基础的ApproxNDCG RoBERTA 模型也实现了20.6%的改进。我们称, 以最易端到端优化的方式直接优化 RoBERTA 模型, 以列表式的替代损失功能功能功能, 以列表化为 ListBERTERTERTERTERTA 格式进行搜索定位, 的功能将如何在实际定位上学习,,,, 以一种低位学习的排序, 以方法, 学习以以的顺序显示的顺序展示方法,, 方法,, 学习,, 学习以学习方法在的以方法, 方法, 以方法, 以方法, 以方法, 以以学习的以的的的的以方法以方法以方法在学习方式进行学习的的的的的的的的的的的的的的的的以的的的方式, 以以的的的的的以以学习学习方式, 的的以以以以以以以平流学习学习方式进行学习更平的的

相关内容

损失函数（机器学习）

关注 10

损失函数，在AI中亦称呼距离函数，度量函数。此处的距离代表的是抽象性的，代表真实数据与预测数据之间的误差。损失函数（loss function）是用来估量你模型的预测值f(x)与真实值Y的不一致程度，它是一个非负实值函数,通常使用L(Y, f(x))来表示，损失函数越小，模型的鲁棒性就越好。损失函数是经验风险函数的核心部分，也是结构风险函数重要组成部分。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日