This paper introduces Summary Explorer, a new tool to support the manual inspection of text summarization systems by compiling the outputs of 55~state-of-the-art single document summarization approaches on three benchmark datasets, and visually exploring them during a qualitative assessment. The underlying design of the tool considers three well-known summary quality criteria (coverage, faithfulness, and position bias), encapsulated in a guided assessment based on tailored visualizations. The tool complements existing approaches for locally debugging summarization models and improves upon them. The tool is available at https://tldr.webis.de/
翻译:本文件介绍摘要探索者,这是支持对文本汇总系统进行手工检查的一种新工具,它汇编了关于三个基准数据集的55~最新单一文件汇总方法的产出,并在质量评估中对其进行直观探讨。该工具的基本设计考虑了三个众所周知的概要质量标准(覆盖、忠诚和定位偏差),这些标准都包含在根据量身定制的可视化进行的指导性评估中。该工具补充了现有的本地调试汇总模型方法,并改进了这些方法。该工具可在https://tldr.webis.de/上查阅。