The use of pretrained embeddings has become widespread in modern e-commerce machine learning (ML) systems. In practice, however, we have encountered several key issues when using pretrained embedding in a real-world production system, many of which cannot be fully explained by current knowledge. Unfortunately, we find that there is a lack of a thorough understanding of how pre-trained embeddings work, especially their intrinsic properties and interactions with downstream tasks. Consequently, it becomes challenging to make interactive and scalable decisions regarding the use of pre-trained embeddings in practice. Our investigation leads to two significant discoveries about using pretrained embeddings in e-commerce applications. Firstly, we find that the design of the pretraining and downstream models, particularly how they encode and decode information via embedding vectors, can have a profound impact. Secondly, we establish a principled perspective of pre-trained embeddings via the lens of kernel analysis, which can be used to evaluate their predictability, interactively and scalably. These findings help to address the practical challenges we faced and offer valuable guidance for successful adoption of pretrained embeddings in real-world production. Our conclusions are backed by solid theoretical reasoning, benchmark experiments, as well as online testings.
翻译:使用预训练嵌入在现代电子商务机器学习系统中已经变得普遍。然而,在实践中,我们遇到了一些重要问题,使用预训练嵌入在实际生产系统中时,很多问题不能得到完全解释。不幸的是,我们发现当前对预训练嵌入的工作原理缺乏透彻的理解,特别是它们的内在特性和与下游任务的相互作用。因此,电子商务机器学习中应用预训练嵌入的决策变得难以交互和可扩展。我们的研究发现使用预训练嵌入在电子商务应用中有两个重大发现。首先,我们发现预训练模型以及下游模型的设计,特别是它们如何通过嵌入向量编码和解码信息,可以产生深远的影响。其次,我们通过核分析的方法确立了预训练嵌入的原则性视角,该视角可以用于交互式和可扩展的预测性评估。这些发现有助于解决我们遇到的实践挑战,为预训练嵌入在实际生产中的成功应用提供宝贵指导。我们的结论基于坚实的理论推理、基准实验和在线测试。