Automatic radiology report generation is critical in clinics which can relieve experienced radiologists from the heavy workload and remind inexperienced radiologists of misdiagnosis or missed diagnose. Existing approaches mainly formulate radiology report generation as an image captioning task and adopt the encoder-decoder framework. However, in the medical domain, such pure data-driven approaches suffer from the following problems: 1) visual and textual bias problem; 2) lack of expert knowledge. In this paper, we propose a knowledge-enhanced radiology report generation approach introduces two types of medical knowledge: 1) General knowledge, which is input independent and provides the broad knowledge for report generation; 2) Specific knowledge, which is input dependent and provides the fine-grained knowledge for report generation. To fully utilize both the general and specific knowledge, we also propose a knowledge-enhanced multi-head attention mechanism. By merging the visual features of the radiology image with general knowledge and specific knowledge, the proposed model can improve the quality of generated reports. Experimental results on two publicly available datasets IU-Xray and MIMIC-CXR show that the proposed knowledge enhanced approach outperforms state-of-the-art image captioning based methods. Ablation studies also demonstrate that both general and specific knowledge can help to improve the performance of radiology report generation.
翻译:自动放射报告生成在诊所至关重要,这些诊所可以减轻有经验的放射学家的繁重工作量,提醒经验丰富的放射学家诊断错误或误诊; 现有方法主要将放射报告生成作为一种图像说明任务,并采用编码解码框架; 然而,在医疗领域,这种纯数据驱动方法存在下列问题:(1) 视觉和文字偏差问题;(2) 缺乏专家知识;在本文件中,我们提议采用知识强化放射报告生成方法,引入两类医疗知识:(1) 普通知识,这是独立投入,为报告生成提供广泛知识;(2) 具体知识,是投入依赖,为报告生成提供精细知识;为充分利用普通和特定知识,我们还提议一个知识强化多头关注机制;通过将放射图像的视觉特征与一般知识和具体知识相结合,拟议模式可以提高生成报告的质量; 两个公开提供的数据集的实验结果,是独立投入,为报告的生成提供了广泛的知识;(2) 具体知识,是投入依赖,为生成报告提供精细的。 为了充分利用一般知识和特定知识,我们还提议一个知识强化多头关注机制; 通过将放射图像图像图像生成的视觉特征与一般知识相结合,拟议的模型可以改进特定图像研究。