Modeling users' dynamic preferences from historical behaviors lies at the core of modern recommender systems. Due to the diverse nature of user interests, recent advances propose the multi-interest networks to encode historical behaviors into multiple interest vectors. In real scenarios, the corresponding items of captured interests are usually retrieved together to get exposure and collected into training data, which produces dependencies among interests. Unfortunately, multi-interest networks may incorrectly concentrate on subtle dependencies among captured interests. Misled by these dependencies, the spurious correlations between irrelevant interests and targets are captured, resulting in the instability of prediction results when training and test distributions do not match. In this paper, we introduce the widely used Hilbert-Schmidt Independence Criterion (HSIC) to measure the degree of independence among captured interests and empirically show that the continuous increase of HSIC may harm model performance. Based on this, we propose a novel multi-interest network, named DEep Stable Multi-Interest Learning (DESMIL), which tries to eliminate the influence of subtle dependencies among captured interests via learning weights for training samples and make model concentrate more on underlying true causation. We conduct extensive experiments on public recommendation datasets, a large-scale industrial dataset and the synthetic datasets which simulate the out-of-distribution data. Experimental results demonstrate that our proposed DESMIL outperforms state-of-the-art models by a significant margin. Besides, we also conduct comprehensive model analysis to reveal the reason why DESMIL works to a certain extent.
翻译:由于用户兴趣的不同性质,最近的进展建议了多种利益网络,将历史行为编码成多种利益矢量。在现实情况下,被捕获利益的相应项目通常被一起检索,以获得接触,并收集成培训数据,从而产生利益之间的依赖性。不幸的是,多重利益网络可能错误地集中于被捕获利益之间的微妙依赖性。由于这些依赖性,不相干的利益和目标之间的虚假关联被捕捉到,导致在培训和测试分布不匹配时,预测结果的不稳定性。在本文件中,我们采用了广泛使用的Hilbert-Schmidt独立标准(HSIC),以衡量被捕获利益之间的独立程度,并从经验上表明,HSIC的持续增长可能损害模型性能。在此基础上,我们提议建立一个新的多利益网络,名为DEep Stable 多重利益学习模式(DEMIL),试图消除被捕获利益之间微妙的相互依赖性影响,通过为培训样品和测试分发分配分配量的模型,使模型重点更多地集中在基础性工业因果关系上。我们进行了广泛的ILA级数据实验,通过大规模数据展示模型模拟数据,从而展示我们提出的模型数据。