学习再利用吸引器支持教育中多重选择产生问题 (Learning to Reuse Distractors to support Multiple Choice Question Generation in Education)

Multiple choice questions (MCQs) are widely used in digital learning systems, as they allow for automating the assessment process. However, due to the increased digital literacy of students and the advent of social media platforms, MCQ tests are widely shared online, and teachers are continuously challenged to create new questions, which is an expensive and time-consuming task. A particularly sensitive aspect of MCQ creation is to devise relevant distractors, i.e., wrong answers that are not easily identifiable as being wrong. This paper studies how a large existing set of manually created answers and distractors for questions over a variety of domains, subjects, and languages can be leveraged to help teachers in creating new MCQs, by the smart reuse of existing distractors. We built several data-driven models based on context-aware question and distractor representations, and compared them with static feature-based models. The proposed models are evaluated with automated metrics and in a realistic user test with teachers. Both automatic and human evaluations indicate that context-aware models consistently outperform a static feature-based approach. For our best-performing context-aware model, on average 3 distractors out of the 10 shown to teachers were rated as high-quality distractors. We create a performance benchmark, and make it public, to enable comparison between different approaches and to introduce a more standardized evaluation of the task. The benchmark contains a test of 298 educational questions covering multiple subjects & languages and a 77k multilingual pool of distractor vocabulary for future research.

翻译：数字学习系统广泛使用多种选择问题(MCQ),因为这些问题可以使评估进程自动化;然而,由于学生数字识字程度提高和社交媒体平台的出现,MCQ测试在网上得到广泛共享,教师在创造新问题方面不断面临挑战,这是一个昂贵和耗时的任务。创建MCQ的一个特别敏感的方面是设计相关的分流器,即不易识别错误的错误答案。本文研究如何利用一大批现有手工创建的对不同领域、主题和语言的问题的分流式回答和转移器,帮助教师通过智能再利用现有分流器来创建新的MCQ。我们根据背景意识问题和分散式陈述建立了若干数据驱动模型,并将这些模型与静态的基于特征的模式进行比较。用自动化的衡量标准和对教师的现实用户测试来评价。自动和人文评价都表明,背景意识模型始终超越基于固定特征的方法。对于我们最佳的环境意识模型来说,通过平均的3分流式分析器和77种语言来帮助教师创建新的MCQ。我们根据对10个标准化的教师进行高分流和高分流测试。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日