While sequential recommender systems achieve significant improvements on capturing user dynamics, we argue that sequential recommenders are vulnerable against substitution-based profile pollution attacks. To demonstrate our hypothesis, we propose a substitution-based adversarial attack algorithm, which modifies the input sequence by selecting certain vulnerable elements and substituting them with adversarial items. In both untargeted and targeted attack scenarios, we observe significant performance deterioration using the proposed profile pollution algorithm. Motivated by such observations, we design an efficient adversarial defense method called Dirichlet neighborhood sampling. Specifically, we sample item embeddings from a convex hull constructed by multi-hop neighbors to replace the original items in input sequences. During sampling, a Dirichlet distribution is used to approximate the probability distribution in the neighborhood such that the recommender learns to combat local perturbations. Additionally, we design an adversarial training method tailored for sequential recommender systems. In particular, we represent selected items with one-hot encodings and perform gradient ascent on the encodings to search for the worst case linear combination of item embeddings in training. As such, the embedding function learns robust item representations and the trained recommender is resistant to test-time adversarial examples. Extensive experiments show the effectiveness of both our attack and defense methods, which consistently outperform baselines by a significant margin across model architectures and datasets.
翻译:虽然顺序建议系统在捕捉用户动态方面有显著改进,但我们认为,顺序建议者在捕捉用户动态方面容易受替代性剖面污染攻击袭击的影响。为了证明我们的假设,我们建议采用基于替代的对抗性攻击算法,通过选择某些脆弱元素来修改输入序列,并以对抗性攻击物品取代它们。在非针对性和有针对性的攻击假设中,我们观察到使用拟议的剖面污染算法导致的显著性能恶化。我们受这种观察的驱动,我们设计了一种高效的对抗性防御方法,称为Drichlet邻居取样。具体地说,我们抽样从由多霍邻居建造的康韦克斯船体嵌入项目,以取代输入序列中的原始物品。在取样期间,Drichlet的分布法用于估计附近地区概率分布,以便让建议者学会打击当地扰动性扰动。此外,我们还设计了一种针对相近性建议系统设计的对抗性训练方法。我们代表了带有一极编码的选定项目,并在编码上显示在搜索培训中嵌入项目最坏的线性组合。因此,嵌入式项目功能学习了稳健的样的物品,Drichletletletlett 分布分布用于测试模型的模型的模型显示和深基底基建模型的模型,以显示我们测试模型的模型的模型显示的模型的模型的模型的模型的模型显示,并显示的模型显示的模型显示的模型显示的基底基底压式模型显示。