This paper presents the first multi-objective transformer model for constructing open cloze tests that exploits generation and discrimination capabilities to improve performance. Our model is further enhanced by tweaking its loss function and applying a post-processing re-ranking algorithm that improves overall test structure. Experiments using automatic and human evaluation show that our approach can achieve up to 82% accuracy according to experts, outperforming previous work and baselines. We also release a collection of high-quality open cloze tests along with sample system output and human annotations that can serve as a future benchmark.
翻译:本文介绍了第一种多目标变压器模型,用于建造利用生成和区分能力提高性能的露天凝块测试;通过调整其损失功能和采用后处理后再排序算法,改进总体测试结构,进一步强化了我们的模型;使用自动和人力评估的实验表明,根据专家,我们的方法可以达到82%的精确度,超过以往的工作和基线;我们还公布了一系列高质量的露天凝块测试,以及抽样系统输出和人文说明,可作为今后的基准。