Chest X-Ray imaging is one of the most common radiological tools for detection of various pathologies related to the chest area and lung function. In a clinical setting, automated assessment of chest radiographs has the potential of assisting physicians in their decision making process and optimize clinical workflows, for example by prioritizing emergency patients. Most work analyzing the potential of machine learning models to classify chest X-ray images focuses on vision methods processing and predicting pathologies for one image at a time. However, many patients undergo such a procedure multiple times during course of a treatment or during a single hospital stay. The patient history, that is previous images and especially the corresponding diagnosis contain useful information that can aid a classification system in its prediction. In this study, we analyze how information about diagnosis can improve CNN-based image classification models by constructing a novel dataset from the well studied CheXpert dataset of chest X-rays. We show that a model trained on additional patient history information outperforms a model trained without the information by a significant margin. We provide code to replicate the dataset creation and model training.
翻译:切斯特X射线成像是发现与胸部和肺功能有关的各种病理的最常见放射工具之一。在临床环境中,对胸部射线仪的自动评估有可能协助医生的决策过程,优化临床工作流程,例如对急诊病人进行优先排序。多数工作分析用于对胸部X射线图像进行分类的机器学习模型的潜力,侧重于对一个图像进行视觉方法处理和预测一次图像的病理。然而,许多病人在治疗过程中或一次住院期间多次接受这种程序。病人的历史,即以前的图像,特别是相应的诊断,包含有助于分类系统的预测的有用信息。在本研究中,我们分析有关诊断的信息如何通过从经过仔细研究的CheXpert胸部X光数据集中建立新的数据集来改进基于CNN的图像分类模型。我们显示,受过额外病人历史信息培训的模型比未经重要信息培训的模型要高得多。我们提供了复制数据集创建和模型培训的代码。