Recommender systems have been widely applied in different real-life scenarios to help us find useful information. Recently, Reinforcement Learning (RL) based recommender systems have become an emerging research topic. It often surpasses traditional recommendation models even most deep learning-based methods, owing to its interactive nature and autonomous learning ability. Nevertheless, there are various challenges of RL when applying in recommender systems. Toward this end, we firstly provide a thorough overview, comparisons, and summarization of RL approaches for five typical recommendation scenarios, following three main categories of RL: value-function, policy search, and Actor-Critic. Then, we systematically analyze the challenges and relevant solutions on the basis of existing literature. Finally, under discussion for open issues of RL and its limitations of recommendation, we highlight some potential research directions in this field.
翻译:为了帮助我们找到有用的信息,在不同的现实情景中广泛应用了建议系统,最近,基于强化学习(RL)的推荐系统已成为一个新的研究课题,由于它的互动性质和自主学习能力,它往往超过传统的推荐模式,甚至超越最深的基于学习的方法。然而,在推荐系统应用时,建议系统面临各种挑战。为此目的,我们首先对五种典型的建议情景,即价值功能、政策搜索和Acor-Cricit等三大类建议情景的RL方法进行透彻的概述、比较和总结。然后,我们根据现有文献系统分析挑战和相关解决方案。最后,在讨论RL的开放问题及其建议局限性时,我们强调该领域的一些潜在研究方向。