This paper presents Summary Workbench, a new tool for developing and evaluating text summarization models. New models and evaluation measures can be easily integrated as Docker-based plugins, allowing to examine the quality of their summaries against any input and to evaluate them using various evaluation measures. Visual analyses combining multiple measures provide insights into the models' strengths and weaknesses. The tool is hosted at \url{https://tldr.demo.webis.de} and also supports local deployment for private resources.
翻译:本文件介绍了《工作摘要》,这是开发和评价文本总结模型的新工具,新的模型和评价措施可以很容易地作为基于多克的插件加以整合,从而能够对照任何投入审查其摘要的质量,并利用各种评价措施加以评价。结合多种措施的视觉分析可以深入了解模型的优缺点。该工具在https://tldr.demo.webis.de}上进行托管,并支持在本地部署私人资源。