While exam-style questions are a fundamental educational tool serving a variety of purposes, manual construction of questions is a complex process that requires training, experience and resources. Automatic question generation (QG) techniques can be utilized to satisfy the need for a continuous supply of new questions by streamlining their generation. However, compared to automatic question answering (QA), QG is a more challenging task. In this work, we fine-tune a multilingual T5 (mT5) transformer in a multi-task setting for QA, QG and answer extraction tasks using Turkish QA datasets. To the best of our knowledge, this is the first academic work that performs automated text-to-text question generation from Turkish texts. Experimental evaluations show that the proposed multi-task setting achieves state-of-the-art Turkish question answering and question generation performance on TQuADv1, TQuADv2 datasets and XQuAD Turkish split. The source code and the pre-trained models are available at https://github.com/obss/turkish-question-generation.
翻译:虽然考试式的问题是一个基本的教育工具,可以满足各种目的,但人工解决问题是一个复杂的过程,需要培训、经验和资源。自动问题生成技术可以通过简化其生成来满足持续提供新问题的需求。然而,与自动回答问题(QA)相比,QG是一项更具有挑战性的任务。在这项工作中,我们微调一个多任务环境中的多语种T5(mT5)变压器,用于QA、QG和用土耳其QA数据集回答提取任务。据我们所知,这是从土耳其文本中自动生成文本到文本问题的首次学术工作。实验性评估显示,拟议的多任务设置实现了土耳其语QuADv1、TQuADv2数据集和XQAD土耳其语分解的回答和问题生成最新表现。源码和预培训模型见https://github.com/obs/turkish-risk-seration。