Alternative recommender systems are critical for ecommerce companies. They guide customers to explore a massive product catalog and assist customers to find the right products among an overwhelming number of options. However, it is a non-trivial task to recommend alternative products that fit customer needs. In this paper, we use both textual product information (e.g. product titles and descriptions) and customer behavior data to recommend alternative products. Our results show that the coverage of alternative products is significantly improved in offline evaluations as well as recall and precision. The final A/B test shows that our algorithm increases the conversion rate by 12 percent in a statistically significant way. In order to better capture the semantic meaning of product information, we build a Siamese Network with Bidirectional LSTM to learn product embeddings. In order to learn a similarity space that better matches the preference of real customers, we use co-compared data from historical customer behavior as labels to train the network. In addition, we use NMSLIB to accelerate the computationally expensive kNN computation for millions of products so that the alternative recommendation is able to scale across the entire catalog of a major ecommerce site.
翻译:替代建议系统对电子商务公司至关重要,它们指导客户探索大型产品目录,并协助客户在绝大多数选项中找到正确的产品。然而,推荐适合客户需要的替代产品是一项非三重任务。在本文中,我们使用文本产品信息(如产品标题和描述)和客户行为数据来推荐替代产品。我们的结果表明,在离线评价中替代产品的覆盖面以及回溯和精确度都大为改进。最后的A/B测试表明,我们的算法以具有统计意义的方式将转换率提高了12%。为了更好地掌握产品信息的语义意义,我们建立了一个带有双向LSTM的暹姆网络来学习产品嵌入。为了学习更符合实际客户偏好程度的类似空间,我们用历史客户行为的数据作为标签来培训网络。此外,我们使用NMSLIB来加快数百万产品计算成本昂贵的KNN的计算速度,以便替代建议能够跨越一个主要电子商务网站的整个目录。