Pretrained language models have significantly improved the performance of down-stream language understanding tasks, including extractive question answering, by providing high-quality contextualized word embeddings. However, learning question answering models still need large-scaled data annotation in specific domains. In this work, we propose a cooperative, self-play learning framework, REGEX, for question generation and answering. REGEX is built upon a masked answer extraction task with an interactive learning environment containing an answer entity REcognizer, a question Generator, and an answer EXtractor. Given a passage with a masked entity, the generator generates a question around the entity, and the extractor is trained to extract the masked entity with the generated question and raw texts. The framework allows the training of question generation and answering models on any text corpora without annotation. We further leverage a reinforcement learning technique to reward generating high-quality questions and to improve the answer extraction model's performance. Experiment results show that REGEX outperforms the state-of-the-art (SOTA) pretrained language models and zero-shot approaches on standard question-answering benchmarks, and yields the new SOTA performance under the zero-shot setting.
翻译:预先培训的语言模型通过提供高质量的背景化字嵌入式词,大大改善了下游语言理解任务,包括问答的绩效,从而大大改善了下游语言理解任务,包括问答的绩效。然而,学习问题回答模型仍然需要在特定领域进行大规模的数据批注。在这项工作中,我们提出了一个合作、自玩学习框架,即REGEX,用于问题生成和回答。REGEX基于一个隐藏式答案提取任务,包含一个包含答复实体REcognizer、问题生成器和答案解答器的互动式学习环境。在与一个蒙面实体的一段通路中,生成者在实体周围产生问题,提取器经过培训,用生成的问题和原始文本来提取蒙面实体。该框架允许在不作批注的情况下就任何文本组合进行问题生成和回答模型的培训。我们进一步利用强化学习技术来奖励产生高质量问题并改进答案提取模型的性能。实验结果表明,REGEX在标准问答基准和标准问答基准零瞄准式方法方面超越了(SOTA)的状态。