Finding online research papers relevant to one's interests is very challenging due to the increasing number of publications. Therefore, personalized research paper recommendation has become a significant and timely research topic. Collaborative filtering is a successful recommendation approach, which exploits the ratings given to items by users as a source of information for learning to make accurate recommendations. However, the ratings are often very sparse as in the research paper domain, due to the huge number of publications growing every year. Therefore, more attention has been drawn to hybrid methods that consider both ratings and content information. Nevertheless, most of the hybrid recommendation approaches that are based on text embedding have utilized bag-of-words techniques, which ignore word order and semantic meaning. In this paper, we propose a hybrid approach that leverages deep semantic representation of research papers based on social tags assigned by users. The experimental evaluation is performed on CiteULike, a real and publicly available dataset. The obtained findings show that the proposed model is effective in recommending research papers even when the rating data is very sparse.
翻译:由于出版物数量不断增加,寻找与个人利益有关的在线研究论文非常困难,因此,个人化研究论文建议已成为重要和及时的研究专题。合作过滤是一种成功的推荐方法,利用用户对项目给予的评级作为学习的信息来源,以得出准确的建议。然而,由于每年增加大量出版物,评级往往与研究论文领域一样非常稀少。因此,更多地注意既考虑评级又考虑内容信息的混合方法。然而,基于文本嵌入的混合推荐方法大多使用了字包技术,忽视了字顺序和语义含义。在本文件中,我们提出一种混合方法,利用基于用户所分配的社会标记的研究论文的深度语义代表性。实验性评价是在一个真实和公开的数据集CiteUet进行。获得的研究结果表明,即使评级数据非常稀少,拟议的模型在推荐研究论文方面也是有效的。