Product descriptions on e-commerce websites often suffer from missing important aspects. Clarification question generation (CQGen) can be a promising approach to help alleviate the problem. Unlike traditional QGen assuming the existence of answers in the context and generating questions accordingly, CQGen mimics user behaviors of asking for unstated information. The generated CQs can serve as a sanity check or proofreading to help e-commerce merchant to identify potential missing information before advertising their product, and improve consumer experience consequently. Due to the variety of possible user backgrounds and use cases, the information need can be quite diverse but also specific to a detailed topic, while previous works assume generating one CQ per context and the results tend to be generic. We thus propose the task of Diverse CQGen and also tackle the challenge of specificity. We propose a new model named KPCNet, which generates CQs with Keyword Prediction and Conditioning, to deal with the tasks. Automatic and human evaluation on 2 datasets (Home & Kitchen, Office) showed that KPCNet can generate more specific questions and promote better group-level diversity than several competing baselines.
翻译:与传统QGen不同的是,CQGen假设在上下文中存在一个CQGen,并据此提出问题,CQGen模仿用户要求未说明的信息的行为。产生的CQQ可以作为一种理智检查或校对,帮助电子商务商人在公布产品之前查明潜在的缺失信息,从而改善消费者经验。由于用户背景和使用案例多种多样,信息可能非常多样,但具体到一个详细专题,而以前的工作假定每个背景产生一个CQ,结果往往比较笼统。我们因此提议多样化CQGen的任务,并处理特殊性的挑战。我们提出一个新的模式,即KPCNet,产生关键词预测和调控,处理任务。对2个数据集(Home & Kitchen, Office)的自动和人力评估表明,KPCNet(Home & Kitchen, Office)可以产生比几个竞争性基线更具体的问题,促进更高质量的群体多样性。