As the body of research on machine narrative comprehension grows, there is a critical need for consideration of performance assessment strategies as well as the depth and scope of different benchmark tasks. Based on narrative theories, reading comprehension theories, as well as existing machine narrative reading comprehension tasks and datasets, we propose a typology that captures the main similarities and differences among assessment tasks; and discuss the implications of our typology for new task design and the challenges of narrative reading comprehension.
翻译:随着机器叙事理解研究的不断增多,迫切需要考虑业绩评估战略以及不同基准任务的深度和广度,根据叙事理论、理解理论以及现有的机器叙事理解任务和数据集,我们建议采用一种类型,捕捉各项评估任务的主要相似点和不同点;讨论我们类型对新任务设计的影响以及叙事理解的挑战。