Sequential recommendation can capture user chronological preferences from their historical behaviors, yet the learning of short sequences (cold-start problem) in many benchmark datasets is still an open challenge. Recently, data augmentation with pseudo-prior items generated by Transformers has drawn considerable attention. These methods can generate pseudo-prior items sequentially in reverse chronological order to extend the original sequences. Nevertheless, the performance may still dramatically degrade in very short sequences; most notably, the generation of pseudo-prior items does not take into account the forward direction (from the past to the future), and so the underlying temporal correlations are not preserved in terms of conditional probabilities. Motivated by this, we propose a Bidirectional Chronological Augmentation of Transformer (BiCAT) that uses a forward learning constraint in the reverse generative process to capture contextual information more effectively. Then, self-knowledge distillation is adopted in augmented and original sequences to bridge the gap between data augmentation and model representation, which enhances the robustness of sequence encoder. Moreover, an informative positive and negative sampling strategy is proposed to accelerate optimization and prevent overfitting. Extensive experiments on two popular real-world datasets demonstrate the efficacy of our method on very short sequences (L < 3) and long sequences (20 < L < 50) as well, our approach outperforms state-of-the-art models by an average of 35.04% and 8.76% respectively, in terms of Recall@5. Source code is available at https://github.com/juyongjiang/BiCAT.
翻译:序列建议可以从历史行为中捕捉用户按时间顺序排列的偏好,然而在许多基准数据集中学习短顺序(从冷战开始的问题)仍是一个公开的挑战。 最近,由变换器产生的假主要项目的数据增强引起了相当的注意。 这些方法可以依次产生假主要项目, 依次按时间顺序排列, 以延长原始序列。 然而, 性能仍然会以非常短的顺序急剧下降; 最显著的是, 假主要项目的生成没有考虑到前向( 从过去到未来的), 因此, 基础时间相关关系不会以有条件的概率来保存。 受此驱动, 我们提议对变换器( BICAT ) 进行双向时间顺序递增数据限制, 以便更有效地获取背景信息。 之后, 以扩大和原始的顺序进行自我知识蒸馏, 弥合数据增强序列( 从过去到未来) 。 此外, 信息化的正反向和负时间相关性的抽样战略, 将加速精度 < Riversal com code 的缩缩缩缩缩缩缩校程。