The IMPRESSIONS section of a radiology report about an imaging study is a summary of the radiologist's reasoning and conclusions, and it also aids the referring physician in confirming or excluding certain diagnoses. A cascade of tasks are required to automatically generate an abstractive summary of the typical information-rich radiology report. These tasks include acquisition of salient content from the report and generation of a concise, easily consumable IMPRESSIONS section. Prior research on radiology report summarization has focused on single-step end-to-end models -- which subsume the task of salient content acquisition. To fully explore the cascade structure and explainability of radiology report summarization, we introduce two innovations. First, we design a two-step approach: extractive summarization followed by abstractive summarization. Second, we additionally break down the extractive part into two independent tasks: extraction of salient (1) sentences and (2) keywords. Experiments on English radiology reports from two clinical sites show our novel approach leads to a more precise summary compared to single-step and to two-step-with-single-extractive-process baselines with an overall improvement in F1 score Of 3-4%.
翻译:有关成像研究的放射学报告中的压缩部分是放射学家推理和结论的概要,它也帮助提供参考的医生确认或排除某些诊断。需要一系列任务来自动生成典型信息丰富放射学报告的抽象摘要。这些任务包括从报告中获取突出内容,并制作一个简明、易于归纳的《压力》部分。以前关于放射学报告总结的研究侧重于单步端到端模型,这包含了突出内容获取的任务。为了充分探索阶梯结构和放射学报告总结的可解释性,我们引入了两个创新。首先,我们设计了两步制方法:提取总结,然后抽象地总结。第二,我们又将抽取部分分为两个独立任务:提取突出(1)句和(2)关键词。两个临床网站对英文放射学报告的实验表明,我们的新做法导致与单步和两步制的扩展过程基线相比,与F1-31分级总体改进的F3-4分位基准。