The travel marketing platform of Alibaba serves an indispensable role for hundreds of different travel scenarios from Fliggy, Taobao, Alipay apps, etc. To provide personalized recommendation service for users visiting different scenarios, there are two critical issues to be carefully addressed. First, since the traffic characteristics of different scenarios, it is very challenging to train a unified model to serve all. Second, during the promotion period, the exposure of some specific items will be re-weighted due to manual intervention, resulting in biased logs, which will degrade the ranking model trained using these biased data. In this paper, we propose a novel Scenario-Aware Ranking Network (SAR-Net) to address these issues. SAR-Net harvests the abundant data from different scenarios by learning users' cross-scenario interests via two specific attention modules, which leverage the scenario features and item features to modulate the user behavior features, respectively. Then, taking the encoded features of previous module as input, a scenario-specific linear transformation layer is adopted to further extract scenario-specific features, followed by two groups of debias expert networks, i.e., scenario-specific experts and scenario-shared experts. They output intermediate results independently, which are further fused into the final result by a multi-scenario gating module. In addition, to mitigate the data fairness issue caused by manual intervention, we propose the concept of Fairness Coefficient (FC) to measures the importance of individual sample and use it to reweigh the prediction in the debias expert networks. Experiments on an offline dataset covering over 80 million users and 1.55 million travel items and an online A/B test demonstrate the effectiveness of our SAR-Net and its superiority over state-of-the-art methods.
翻译:阿里巴巴的旅行营销平台对于来自弗莱吉、道宝、阿里帕伊应用程序等数百种不同的旅行情景发挥了不可或缺的作用。 为了向访问不同情景的用户提供个性化建议服务,有两个关键问题需要认真处理。首先,由于不同情景的交通特点,因此很难训练一个统一的模型来为所有人服务。第二,在升级期间,由于人工干预,某些特定物项的暴露将因手工干预而重新加权,从而导致有偏差的日志,这将降低利用这些偏差数据培训的排名模式。在本文件中,我们提议建立一个新的情景-软件分级网络(SAR-Net)来解决这些问题。SAR-Net通过两个特定的关注模块学习用户的跨情景利益,从不同情景中获取大量的数据,这些模块分别利用情景特征和项目特性来调整用户的行为行为特征。 之后,以前一个模块的编码特征作为投入,采用一个特定情景-线性转换层来进一步提取情景-具体模式的特征,然后由两组专家网络(即SAR-Net)来解决这些问题。SAR-Net从不同的情景分级网络获取大量数据,通过两个不同情景-具体数据,然后由我们分别提出一个测试结果,然后用一个中间数据,然后用一个测试结果,然后用一个测试结果,然后用一个测试数据,然后用一个模型-直向在线数据显示。