Locating diseases in chest X-ray images with few careful annotations saves large human effort. Recent works approached this task with innovative weakly-supervised algorithms such as multi-instance learning (MIL) and class activation maps (CAM), however, these methods often yield inaccurate or incomplete regions. One of the reasons is the neglection of the pathological implications hidden in the relationship across anatomical regions within each image and the relationship across images. In this paper, we argue that the cross-region and cross-image relationship, as contextual and compensating information, is vital to obtain more consistent and integral regions. To model the relationship, we propose the Graph Regularized Embedding Network (GREN), which leverages the intra-image and inter-image information to locate diseases on chest X-ray images. GREN uses a pre-trained U-Net to segment the lung lobes, and then models the intra-image relationship between the lung lobes using an intra-image graph to compare different regions. Meanwhile, the relationship between in-batch images is modeled by an inter-image graph to compare multiple images. This process mimics the training and decision-making process of a radiologist: comparing multiple regions and images for diagnosis. In order for the deep embedding layers of the neural network to retain structural information (important in the localization task), we use the Hash coding and Hamming distance to compute the graphs, which are used as regularizers to facilitate training. By means of this, our approach achieves the state-of-the-art result on NIH chest X-ray dataset for weakly-supervised disease localization. Our codes are accessible online (https://github.com/qibaolian/GREN).
翻译:将疾病在胸前X射线图像中定位而没有细微的描述可以节省大量人类努力。最近的工作与多因子学习(MIL)和课堂启动地图(CAM)等监管薄弱的创新算法接近了这项任务,但是,这些方法往往产生不准确或不完整的区域。其中一个原因是忽略了每个图像内各解剖区域的关系中隐藏的病理影响,以及图像之间的关系。在本文中,我们认为跨区域和跨图像关系,作为背景和补偿性信息,对于获得更加一致和一体化的区域至关重要。为了模拟这种关系,我们建议GREN(GREN),利用图中图像内部成像和图像间成像来定位胸部X射线图像。GREN使用预先训练的U-Net来分析肺部,然后用一个内部直径直线图来模拟不同区域的肺部叶部内部图像关系。同时,匹配图像之间的结果关系是用一个内部直径直径图来建模的模型,用一个内径直径直图来模拟。这个内建图用来比较我们内部的内径结构结构图,用来比较我们内部的内置结构结构图,用来比较我们内部的图像。这个内置的图,用来比较了内置结构结构结构结构图,用来用来用来比较我们内部的图层结构结构图。这个结构结构图解的图解。这个系统路段,用来用来用来比较我们的内置。这个系统,用来用来用来用来用来比较我们的内置。