Recent advancements of sequential deep learning models such as Transformer and BERT have significantly facilitated the sequential recommendation. However, according to our study, the distribution of item embeddings generated by these models tends to degenerate into an anisotropic shape, which may result in high semantic similarities among embeddings. In this paper, both empirical and theoretical investigations of this representation degeneration problem are first provided, based on which a novel recommender model DuoRec is proposed to improve the item embeddings distribution. Specifically, in light of the uniformity property of contrastive learning, a contrastive regularization is designed for DuoRec to reshape the distribution of sequence representations. Given the convention that the recommendation task is performed by measuring the similarity between sequence representations and item embeddings in the same space via dot product, the regularization can be implicitly applied to the item embedding distribution. Existing contrastive learning methods mainly rely on data level augmentation for user-item interaction sequences through item cropping, masking, or reordering and can hardly provide semantically consistent augmentation samples. In DuoRec, a model-level augmentation is proposed based on Dropout to enable better semantic preserving. Furthermore, a novel sampling strategy is developed, where sequences having the same target item are chosen hard positive samples. Extensive experiments conducted on five datasets demonstrate the superior performance of the proposed DuoRec model compared with baseline methods. Visualization results of the learned representations validate that DuoRec can largely alleviate the representation degeneration problem.
翻译:根据我们的研究,这些模型产生的嵌入物的分布往往会演变成厌食形状,这可能导致嵌入物之间在语义上的高度相似性。在本文件中,首先提供对这一代表物变异问题的实验和理论调查,并在此基础上提出一个新的建议模型DuoRec,以改进项目嵌入分布。具体地说,鉴于对比性学习的统一性能,DuoRec为DuoRec设计了对比性正规化,以重新塑造序列表达物的分布。鉴于这个公约,建议任务是通过测量序列表达物与通过点产品嵌入物嵌入物在同一空间的嵌入物之间的相似性来完成的。在嵌入分布方面,可以隐含地应用这种正规化,现有的对比性学习方法主要依靠数据水平增强用户-项目互动序列,通过项目裁成模型、遮掩、或重新排序,很难提供自定义性连贯的增强样本。在DuoRec中,一个模型级递增层层次的递增主要是为了测量结果,然后在丢弃式的基底级模型上提出一个更精确的缩缩缩缩的模型,以演示方法进行。