Bundle recommender systems recommend sets of items (e.g., pants, shirt, and shoes) to users, but they often suffer from two issues: significant interaction sparsity and a large output space. In this work, we extend multi-round conversational recommendation (MCR) to alleviate these issues. MCR, which uses a conversational paradigm to elicit user interests by asking user preferences on tags (e.g., categories or attributes) and handling user feedback across multiple rounds, is an emerging recommendation setting to acquire user feedback and narrow down the output space, but has not been explored in the context of bundle recommendation. In this work, we propose a novel recommendation task named Bundle MCR. We first propose a new framework to formulate Bundle MCR as Markov Decision Processes (MDPs) with multiple agents, for user modeling, consultation and feedback handling in bundle contexts. Under this framework, we propose a model architecture, called Bundle Bert (Bunt) to (1) recommend items, (2) post questions and (3) manage conversations based on bundle-aware conversation states. Moreover, to train Bunt effectively, we propose a two-stage training strategy. In an offline pre-training stage, Bunt is trained using multiple cloze tasks to mimic bundle interactions in conversations. Then in an online fine-tuning stage, Bunt agents are enhanced by user interactions. Our experiments on multiple offline datasets as well as the human evaluation show the value of extending MCR frameworks to bundle settings and the effectiveness of our Bunt design.
翻译:Bundle 推荐系统向用户推荐成套项目(如裤子、衬衫和鞋),但它们往往受到两个问题的影响:显著的互动宽度和巨大的输出空间。在这项工作中,我们推广了多轮对话建议(MCR)来缓解这些问题。 MCR使用一个谈话模式来吸引用户兴趣,询问用户对标签(如类别或属性)的偏好,并处理多个回合的用户反馈,这是一个新出现的建议设置,以获取用户反馈,缩小产出环境,但在汇总建议中尚未探讨。在这项工作中,我们提出了名为Bundle MCR 的新建议任务。我们首先提出了一个新的框架,以制定Bundle MCR 作为Markov 决策进程(MCR),与多个代理商一起,用于用户建模、磋商和反馈处理。在这个框架内,我们提出了一个模型架构,叫Bundle Bert (Buntle Bert), 以建议项目、 后线问题和(3) 管理基于捆绑式对话状态的对话。此外,为了有效地培训Buntle,我们提出了名为Bundlead MRC 的双级互动战略,我们建议用双阶段的Bent 将多阶段培训工具在Buntal Buntreal Binal 上展示一个升级的服务器上, 。