User behavior data produced during interaction with massive items in the significant data era are generally heterogeneous and sparse, leaving the recommender system (RS) a large diversity of underlying patterns to excavate. Deep neural network-based models have reached the state-of-the-art benchmark of the RS owing to their fitting capabilities. However, prior works mainly focus on designing an intricate architecture with fixed loss function and regulation. These single-metric models provide limited performance when facing heterogeneous and sparse user behavior data. Motivated by this finding, we propose a multi-metric AutoRec (MMA) based on the representative AutoRec. The idea of the proposed MMA is mainly two-fold: 1) apply different $L_p$-norm on loss function and regularization to form different variant models in different metric spaces, and 2) aggregate these variant models. Thus, the proposed MMA enjoys the multi-metric orientation from a set of dispersed metric spaces, achieving a comprehensive representation of user data. Theoretical studies proved that the proposed MMA could attain performance improvement. The extensive experiment on five real-world datasets proves that MMA can outperform seven other state-of-the-art models in predicting unobserved user behavior data.
翻译:在与重要数据时代的大型项目互动期间产生的用户行为数据一般是多种多样和稀少的,因此建议者系统(RS)有多种多样的要挖掘的基本模式。深神经网络模型由于其安装能力,已经达到RS的最先进的基准。然而,先前的工作主要侧重于设计一个具有固定损失功能和规范的复杂结构。这些单数模型在面对不同和分散用户行为数据时提供有限的性能。根据这一发现,我们提议以代表AutoRec为基础的多度自动Rec(MMA)。提议的MMA概念主要有两个方面:1)在损失功能上应用不同的L_p$-norm,并规范在不同指标空间形成不同的变式模型,2)汇总这些变式模型。因此,拟议的MMA从一套分散的计量空间中遵循了多度方向,实现了用户数据的全面表述。理论研究证明,拟议的MMA可以实现性能改进。关于五个真实世界数据集的广泛实验证明,MMA在预测其他7个状态的用户行为模型中可以超越其他不观测的用户行为模型。