以多重选择问题为基础的多方交流政策学习促进相互建议 (Multiple Choice Questions based Multi-Interest Policy Learning for Conversational Recommendation)

Conversational recommendation system (CRS) is able to obtain fine-grained and dynamic user preferences based on interactive dialogue. Previous CRS assumes that the user has a clear target item. However, for many users who resort to CRS, they might not have a clear idea about what they really like. Specifically, the user may have a clear single preference for some attribute types (e.g. color) of items, while for other attribute types, the user may have multiple preferences or even no clear preferences, which leads to multiple acceptable attribute instances (e.g. black and red) of one attribute type. Therefore, the users could show their preferences over items under multiple combinations of attribute instances rather than a single item with unique combination of all attribute instances. As a result, we first propose a more realistic CRS learning setting, namely Multi-Interest Multi-round Conversational Recommendation, where users may have multiple interests in attribute instance combinations and accept multiple items with partially overlapped combinations of attribute instances. To effectively cope with the new CRS learning setting, in this paper, we propose a novel learning framework namely, Multi-Choice questions based Multi-Interest Policy Learning . In order to obtain user preferences more efficiently, the agent generates multi-choice questions rather than binary yes/no ones on specific attribute instance. Besides, we propose a union set strategy to select candidate items instead of existing intersection set strategy in order to overcome over-filtering items during the conversation. Finally, we design a Multi-Interest Policy Learning module, which utilizes captured multiple interests of the user to decide next action, either asking attribute instances or recommending items. Extensive experimental results on four datasets verify the superiority of our method for the proposed setting.

翻译： conversation 推荐系统( CRS) 能够在互动对话的基础上获得精细的和动态的用户偏好。以前的 CRS 假设用户拥有一个清晰的目标项。但是, 许多使用 CRS 的用户可能无法清楚地了解他们真正喜欢什么。具体地说, 用户可能对某些属性类型( 如颜色) 有着明确的单一偏好, 而对于其他属性类型, 用户可能有多重偏好, 甚至没有明确的偏好, 从而导致一个属性类型的多重可接受属性实例( 如黑红) 。因此, 用户可以显示他们对多个属性选项的偏好。因此, 对于许多使用 CRS 的用户来说, 他们可能没有明确的目标项。用户可以显示他们对多个属性的选项的偏好, 而不是一个单一的项目。因此, 我们首先提议一个更现实的 CRS 学习框架, 即多功能化的选项, 而不是多功能化的选项。我们提议一个新的 CRS 学习框架, 以多功能化的选项, 而不是多功能化的选项。