以多目标为导向的强化学习指导性多目的试读纸制文件制作</s> (Reinforcement Learning Guided Multi-Objective Exam Paper Generation)

To reduce the repetitive and complex work of instructors, exam paper generation (EPG) technique has become a salient topic in the intelligent education field, which targets at generating high-quality exam paper automatically according to instructor-specified assessment criteria. The current advances utilize the ability of heuristic algorithms to optimize several well-known objective constraints, such as difficulty degree, number of questions, etc., for producing optimal solutions. However, in real scenarios, considering other equally relevant objectives (e.g., distribution of exam scores, skill coverage) is extremely important. Besides, how to develop an automatic multi-objective solution that finds an optimal subset of questions from a huge search space of large-sized question datasets and thus composes a high-quality exam paper is urgent but non-trivial. To this end, we skillfully design a reinforcement learning guided Multi-Objective Exam Paper Generation framework, termed MOEPG, to simultaneously optimize three exam domain-specific objectives including difficulty degree, distribution of exam scores, and skill coverage. Specifically, to accurately measure the skill proficiency of the examinee group, we first employ deep knowledge tracing to model the interaction information between examinees and response logs. We then design the flexible Exam Q-Network, a function approximator, which automatically selects the appropriate question to update the exam paper composition process. Later, MOEPG divides the decision space into multiple subspaces to better guide the updated direction of the exam paper. Through extensive experiments on two real-world datasets, we demonstrate that MOEPG is feasible in addressing the multiple dilemmas of exam paper generation scenario.

翻译：为了减少教员的重复和复杂工作,考试论文制作技术已成为智能教育领域的一个突出主题,该技术的目标是根据教员指定的评估标准自动生成高质量的考试论文。目前的进展利用了休眠算法的能力优化若干众所周知的客观制约因素,如难度、问题数量等,以产生最佳解决办法。然而,在现实情况下,考虑到其他同等相关的目标(例如考试分数的分布、技能覆盖面),这已成为智能教育领域的一个突出主题。此外,如何开发一个自动的多目标解决方案,从大型问题数据集的庞大搜索空间中找到最佳的一组问题,从而形成高质量的考试文件,这是紧迫的,但非三重性。为此,我们巧妙地设计了一个强化学习指导多动感动Examp造纸框架,称为MOEPG,同时优化三个考试特定领域的目标,包括困难程度、考试分数分布和技能覆盖面。具体地说,为了准确衡量当时的审查组的技能熟练程度,我们首先利用深入的知识追踪,然后在网络的大规模搜索中进行互动信息模型,然后是自动地更新Exdeal Excal Excial oral Produal ex a laction the the subal laction the subal ex askal askal lax the the lax the drobal access access access access access the the labildaldaldaldal abaldaldaldaldaldal abildaldaldaldaldaldaldal abild the.</s>