Recently, sequential recommendation has emerged as a widely studied topic. Existing researches mainly design effective neural architectures to model user behavior sequences based on item IDs. However, this kind of approach highly relies on user-item interaction data and neglects the attribute- or characteristic-level correlations among similar items preferred by a user. In light of these issues, we propose IDA-SR, which stands for ID-Agnostic User Behavior Pre-training approach for Sequential Recommendation. Instead of explicitly learning representations for item IDs, IDA-SR directly learns item representations from rich text information. To bridge the gap between text semantics and sequential user behaviors, we utilize the pre-trained language model as text encoder, and conduct a pre-training architecture on the sequential user behaviors. In this way, item text can be directly utilized for sequential recommendation without relying on item IDs. Extensive experiments show that the proposed approach can achieve comparable results when only using ID-agnostic item representations, and performs better than baselines by a large margin when fine-tuned with ID information.
翻译:最近,相继建议作为一个广泛研究的专题出现。现有的研究主要是设计有效的神经结构,以根据项目ID来模拟用户行为序列。然而,这种方法高度依赖用户项目互动数据,忽视用户所偏爱的类似项目之间的属性或特征相关性。鉴于这些问题,我们提议开发协会-SR(IDA-Agnotistic用户行为)为序列建议培训前办法。开发协会-SR(IDA-SR)不是对项目ID进行明确学习,而是直接从丰富的文本信息中学习项目表述。为了弥合文本语义学和顺序用户行为之间的差距,我们使用预先培训的语言模式作为文本编码器,并进行关于顺序用户行为的培训前结构。这样,项目文本可以直接用于顺序建议,而不必依赖项目标识。广泛的实验表明,拟议的方法只有在只使用ID-Annocial项目表述时才能取得可比较的结果,并且在与ID信息进行微调时比基线要好得多。