The automatic clinical caption generation problem is referred to as proposed model combining the analysis of frontal chest X-Ray scans with structured patient information from the radiology records. We combine two language models, the Show-Attend-Tell and the GPT-3, to generate comprehensive and descriptive radiology records. The proposed combination of these models generates a textual summary with the essential information about pathologies found, their location, and the 2D heatmaps localizing each pathology on the original X-Ray scans. The proposed model is tested on two medical datasets, the Open-I, MIMIC-CXR, and the general-purpose MS-COCO. The results measured with the natural language assessment metrics prove their efficient applicability to the chest X-Ray image captioning.
翻译:自动临床标题生成问题被称为将前胸X射线扫描分析与放射记录中的结构性病人信息相结合的拟议模型,我们结合了两种语言模型,即Show-Attend Tell和GPT-3,以生成全面和描述性放射记录。这些模型的拟议组合生成了文字摘要,其中含有所发现的病理学、其位置和2D热图的基本信息,将每个病理学都定位在原X射线扫描上。在两个医疗数据集,即Open-I、MIMIMIC-CXR和通用MS-COCO,上测试了拟议模型。用自然语言评估指标衡量的结果证明了其对胸部X光图像说明的有效适用性。