Embedding based product recommendations have gained popularity in recent years due to its ability to easily integrate to large-scale systems and allowing nearest neighbor searches in real-time. The bulk of studies in this area has predominantly been focused on similar item recommendations. Research on complementary item recommendations, on the other hand, still remains considerably under-explored. We define similar items as items that are interchangeable in terms of their utility and complementary items as items that serve different purposes, yet are compatible when used with one another. In this paper, we apply a novel approach to finding complementary items by leveraging dual embedding representations for products. We demonstrate that the notion of relatedness discovered in NLP for skip-gram negative sampling (SGNS) models translates effectively to the concept of complementarity when training item representations using co-purchase data. Since sparsity of purchase data is a major challenge in real-world scenarios, we further augment the model using synthetic samples to extend coverage. This allows the model to provide complementary recommendations for items that do not share co-purchase data by leveraging other abundantly available data modalities such as images, text, clicks etc. We establish the effectiveness of our approach in improving both coverage and quality of recommendations on real world data for a major online retail company. We further show the importance of task specific hyperparameter tuning in training SGNS. Our model is effective yet simple to implement, making it a great candidate for generating complementary item recommendations at any e-commerce website.
翻译:近年来,基于嵌入式产品的建议由于能够很容易地融入大型系统,并允许近邻实时搜索,近年来受到欢迎。这一领域的大部分研究主要侧重于类似的项目建议。另一方面,对补充项目建议的研究仍然相当不足。我们将类似项目界定为可互换的、其效用和补充项目的项目,作为用途不同的物品,但在使用时是兼容的。在本文件中,我们采用新颖办法寻找补充项目,利用产品双重嵌入式表示方式。我们证明,在NLP中发现的用于跳格负抽样模型的关联性概念,在使用共同购买数据进行项目说明培训时,可有效地转化为互补概念。由于采购数据的紧张性是现实世界情景中的一项重大挑战,我们进一步增加使用合成样品来扩大覆盖面的模型。这样,通过利用图像、文本、点击等其他大量现有数据模式,为不分享共同购买数据的项目提供补充建议。我们为改进全球范围培训中的项目设定了一个有效的电子项目,在改进全球范围方面,我们在改进全球范围,同时为全球范围提供一项简单的具体培训建议,我们为全球范围提供一项在线数据,从而进一步提升全球范围,我们为全球范围提供一项在线提供具体建议。