Localization and characterization of diseases like pneumonia are primary steps in a clinical pipeline, facilitating detailed clinical diagnosis and subsequent treatment planning. Additionally, such location annotated datasets can provide a pathway for deep learning models to be used for downstream tasks. However, acquiring quality annotations is expensive on human resources and usually requires domain expertise. On the other hand, medical reports contain a plethora of information both about pneumonia characteristics and its location. In this paper, we propose a novel weakly-supervised attention-driven deep learning model that leverages encoded information in medical reports during training to facilitate better localization. Our model also performs classification of attributes that are associated to pneumonia and extracted from medical reports for supervision. Both the classification and localization are trained in conjunction and once trained, the model can be utilized for both the localization and characterization of pneumonia using only the input image. In this paper, we explore and analyze the model using chest X-ray datasets and demonstrate qualitatively and quantitatively that the introduction of textual information improves pneumonia localization. We showcase quantitative results on two datasets, MIMIC-CXR and Chest X-ray-8, and we also showcase severity characterization on the COVID-19 dataset.
翻译:肺炎等疾病的本地化和定性是临床管道中的首要步骤,有助于详细的临床诊断和随后的治疗规划。此外,这种位置的附加说明的数据集可以为用于下游任务的深层次学习模式提供一个途径。然而,获得高质量的说明对人力资源来说费用昂贵,通常需要领域专长。另一方面,医疗报告包含大量关于肺炎特征及其位置的信息。在本文件中,我们提议了一个新的、监督不力的、受关注驱动的深层次学习模式,在培训期间利用医学报告中的编码信息,促进更好的本地化。我们的模型还进行与肺炎有关的属性分类,并从医疗报告中提取供监督之用。分类和本地化都经过共同培训,一旦经过培训,该模型就可以用于肺炎本地化和定性,仅使用输入图像。在本文中,我们利用胸X光数据集对模型进行探讨和分析,并从质量和数量上表明采用文字信息可以改善肺炎本地化。我们展示了两个数据集的定量结果,MIMI-CXR和Chest X-ray-8,我们还展示了COVI的定量数据特征。