A number of models for neural content-based news recommendation have been proposed. However, there is limited understanding of the relative importances of the three main components of such systems (news encoder, user encoder, and scoring function) and the trade-offs involved. In this paper, we assess the hypothesis that the most widely used means of matching user and candidate news representations is not expressive enough. We allow our system to model more complex relations between the two by assessing more expressive scoring functions. Across a wide range of baseline and established systems this results in consistent improvements of around 6 points in AUC. Our results also indicate a trade-off between the complexity of news encoder and scoring function: A fairly simple baseline model scores well above 68% AUC on the MIND dataset and comes within 2 points of the published state-of-the-art, while requiring a fraction of the computational costs.
翻译:提出了若干基于神经内容的新闻建议模式,但对这种系统三个主要组成部分(新编码器、用户编码器和评分功能)的相对重要性和所涉权衡的相对重要性了解有限。在本文件中,我们评估了一个假设,即最广泛使用的匹配用户和候选新闻表述方式的方法不够清晰。我们允许我们的系统通过评估更直观的评分功能来建模两者之间更为复杂的关系。在一系列广泛的基线和既定系统中,这导致AUC中大约6点的一致改进。我们的结果还表明,新闻编码器和评分功能的复杂程度是相互权衡的:在MIND数据集中,一个相当简单的基线模型得分远远超过68%的ACU,在所公布的最先进的两个点之内,同时需要一定的计算成本。