Medical imaging is frequently used in clinical practice and trials for diagnosis and treatment. Writing imaging reports is time-consuming and can be error-prone for inexperienced radiologists. Therefore, automatically generating radiology reports is highly desired to lighten the workload of radiologists and accordingly promote clinical automation, which is an essential task to apply artificial intelligence to the medical domain. In this paper, we propose to generate radiology reports with memory-driven Transformer, where a relational memory is designed to record key information of the generation process and a memory-driven conditional layer normalization is applied to incorporating the memory into the decoder of Transformer. Experimental results on two prevailing radiology report datasets, IU X-Ray and MIMIC-CXR, show that our proposed approach outperforms previous models with respect to both language generation metrics and clinical evaluations. Particularly, this is the first work reporting the generation results on MIMIC-CXR to the best of our knowledge. Further analyses also demonstrate that our approach is able to generate long reports with necessary medical terms as well as meaningful image-text attention mappings.
翻译:医学成像经常用于临床实践和诊断和治疗试验。写成成像报告耗费时间,对缺乏经验的放射学家来说容易出错。因此,自动生成放射学报告非常希望减轻放射学家的工作量,并因此促进临床自动化,这是将人工智能应用于医疗领域的一项基本任务。在本文中,我们提议用记忆驱动变异器制作放射学报告,其中设计了一种关系记忆记忆,以记录生成过程的关键信息,并应用了记忆驱动的有条件层正常化,将记忆纳入变异器的解码器中。两个流行的放射学报告数据集(IU-X光和MIMIC-CXR)的实验结果显示,我们所提议的方法在语言生成指标和临床评估方面都超越了以前的模式。特别是,这是首次报告MIMIC-CXR生成结果的工作,以最佳的知识为基础。进一步的分析还表明,我们的方法能够产生长的报告,并附有必要的医学术语和有意义的图像文字关注图象。