With the recent advances of conversational recommendations, the recommender system is able to actively and dynamically elicit user preference via conversational interactions. To achieve this, the system periodically queries users' preference on attributes and collects their feedback. However, most existing conversational recommender systems only enable the user to provide absolute feedback to the attributes. In practice, the absolute feedback is usually limited, as the users tend to provide biased feedback when expressing the preference. Instead, the user is often more inclined to express comparative preferences, since user preferences are inherently relative. To enable users to provide comparative preferences during conversational interactions, we propose a novel comparison-based conversational recommender system. The relative feedback, though more practical, is not easy to be incorporated since its feedback scale is always mismatched with users' absolute preferences. With effectively collecting and understanding the relative feedback from an interactive manner, we further propose a new bandit algorithm, which we call RelativeConUCB. The experiments on both synthetic and real-world datasets validate the advantage of our proposed method, compared to the existing bandit algorithms in the conversational recommender systems.
翻译:由于最近对话建议的进展,建议系统能够积极和动态地通过对话互动来吸引用户偏好。为了实现这一点,系统定期询问用户对属性的偏好,并收集他们的反馈。然而,大多数现有的对话建议系统只使用户能够对属性提供绝对反馈。在实践中,绝对反馈通常有限,因为用户在表达偏好时往往提供有偏见的反馈。相反,用户往往更倾向于表达比较偏好,因为用户偏好本质上是相对的。为了让用户在对话互动期间提供比较偏好,我们提议了一个新的比较性对话建议系统。相对的反馈虽然比较实用,但不容易纳入,因为其反馈规模总是与用户的绝对偏好不匹配。通过有效收集和理解互动方式的相对反馈,我们进一步提议采用新的波段算法,我们称之为相对的CONUCB。关于合成和真实世界数据集的实验证实了我们拟议方法的优势,与对话建议系统中的现有的波段算法相比。