End-to-end training has been a popular approach for knowledge base question answering (KBQA). However, real world applications often contain answers of varied quality for users' questions. It is not appropriate to treat all available answers of a user question equally. This paper proposes a novel approach based on multiple instance learning to address the problem of noisy answers by exploring consensus among answers to the same question in training end-to-end KBQA models. In particular, the QA pairs are organized into bags with dynamic instance selection and different options of instance weighting. Curriculum learning is utilized to select instance bags during training. On the public CQA dataset, the new method significantly improves both entity accuracy and the Rouge-L score over a state-of-the-art end-to-end KBQA baseline.
翻译:端对端培训是知识基础问题解答的流行方法(KBQA),然而,现实世界应用中往往包含不同质量的用户问题解答,不宜一视同仁地对待用户问题的所有现有解答,本文件提出基于多实例学习的新颖方法,通过探索对培训端对端KBQA模式中同一问题解答的共识,解决噪音问题。特别是,质量A对子被组织成袋,有动态实例选择和不同的实例加权选项。课程学习用于在培训中选择实例包。在公共的 CQA数据集中,新方法极大地提高了实体准确性和红色-L在最先进的端对端KBQA基线上的分数。