To enable a deep learning-based system to be used in the medical domain as a computer-aided diagnosis system, it is essential to not only classify diseases but also present the locations of the diseases. However, collecting instance-level annotations for various thoracic diseases is expensive. Therefore, weakly supervised localization methods have been proposed that use only image-level annotation. While the previous methods presented the disease location as the most discriminative part for classification, this causes a deep network to localize wrong areas for indistinguishable X-ray images. To solve this issue, we propose a spatial attention method using disease masks that describe the areas where diseases mainly occur. We then apply the spatial attention to find the precise disease area by highlighting the highest probability of disease occurrence. Meanwhile, the various sizes, rotations and noise in chest X-ray images make generating the disease masks challenging. To reduce the variation among images, we employ an alignment module to transform an input X-ray image into a generalized image. Through extensive experiments on the NIH-Chest X-ray dataset with eight kinds of diseases, we show that the proposed method results in superior localization performances compared to state-of-the-art methods.
翻译:为使深入的学习系统能够作为计算机辅助的诊断系统在医疗领域使用,不仅对疾病进行分类,而且对显示疾病地点也至关重要。然而,收集各种色素疾病的例级说明非常昂贵。因此,建议采用监督不力的本地化方法,只使用图像级注解。虽然以前的方法将疾病地点描述为分类中最具歧视性的部分,但造成一个深度网络,将错误的区域定位为无法分辨的X光图像。为了解决这个问题,我们建议使用一种空间关注方法,用疾病面具描述主要发生疾病的地区。然后我们运用空间关注,通过突出疾病发生概率来找到确切的疾病区。与此同时,胸部X光图像中的各种大小、旋转和噪音导致疾病口罩的挑战性。为减少图像之间的差异,我们使用一个校准模块将输入X光图像转换成一个通用图像。通过对八种疾病的NIH-Chest X光数据集进行广泛的实验,我们展示了拟议方法在与状态比较的本地化方法上取得优异性结果。