Conversational recommendation system (CRS) is emerging as a user-friendly way to capture users' dynamic preferences over candidate items and attributes. Multi-shot CRS is designed to make recommendations multiple times until the user either accepts the recommendation or leaves at the end of their patience. Existing works are trained with reinforcement learning (RL), which may suffer from unstable learning and prohibitively high demands for computing. In this work, we propose a simple and efficient CRS, MInimalist Non-reinforced Interactive COnversational Recommender Network (MINICORN). MINICORN models the epistemic uncertainty of the estimated user preference and queries the user for the attribute with the highest uncertainty. The system employs a simple network architecture and makes the query-vs-recommendation decision using a single rule. Somewhat surprisingly, this minimalist approach outperforms state-of-the-art RL methods on three real-world datasets by large margins. We hope that MINICORN will serve as a valuable baseline for future research.
翻译:多镜头 CRS旨在多次提出建议,直到用户接受建议或在其耐心结束时离开为止。现有工作经过强化学习培训(RL),这可能会受到不稳定的学习和过高的计算需求的影响。在这项工作中,我们提出了一个简单而高效的 CRS(MINICORN)网络(MINICORN)模型,估计用户偏好的缩略图不确定性,并用最不确定的属性询问用户。这个系统使用简单的网络结构,并使用单一规则作出查询-V建议决定。有些令人惊讶的是,这种最起码的方法使三大真实世界数据集的 RL 方法出现巨大利润。我们希望MIICORN将成为未来研究的宝贵基线。