A recommender system aims to recommend items that a user is interested in among many items. The need for the recommender system has been expanded by the information explosion. Various approaches have been suggested for providing meaningful recommendations to users. One of the proposed approaches is to consider a recommender system as a Markov decision process (MDP) problem and try to solve it using reinforcement learning (RL). However, existing RL-based methods have an obvious drawback. To solve an MDP in a recommender system, they encountered a problem with the large number of discrete actions that bring RL to a larger class of problems. In this paper, we propose a novel RL-based recommender system. We formulate a recommender system as a gridworld game by using a biclustering technique that can reduce the state and action space significantly. Using biclustering not only reduces space but also improves the recommendation quality effectively handling the cold-start problem. In addition, our approach can provide users with some explanation why the system recommends certain items. Lastly, we examine the proposed algorithm on a real-world dataset and achieve a better performance than the widely used recommendation algorithm.
翻译:推荐人系统旨在推荐用户对许多项目感兴趣的项目; 信息爆炸扩大了对推荐人系统的需求; 提出了向用户提供有意义建议的各种办法; 提议的办法之一是将推荐人系统视为Markov决策程序(MDP)的问题,并试图利用强化学习(RL)解决该问题。 但是,基于RL的现有方法显然有一个明显的缺陷。 为了在推荐人系统中解决一个 MDP,他们遇到一个问题,因为有大量的离散行动,使RL陷入了更大的问题类别。 在本文中,我们提出了一个新的基于RL的建议系统。我们用双组技术将推荐人系统设计成一个网格世界游戏,这样可以大大减少状态和行动空间。使用双组组合不仅减少空间,而且提高建议的质量,有效地处理冷启动问题。 此外,我们的方法可以向用户提供某种解释,说明为什么该系统建议某些项目。 最后,我们研究了一个基于真实世界数据集的拟议算法,并取得了比广泛使用的建议算法更好的性。