Building a highly accurate predictive model for these tasks usually requires a large number of manually annotated labels and pixel regions (bounding boxes) of abnormalities. However, it is expensive to acquire such annotations, especially the bounding boxes. Recently, contrastive learning has shown strong promise in leveraging unlabeled natural images to produce highly generalizable and discriminative features. However, extending its power to the medical image domain is under-explored and highly non-trivial, since medical images are much less amendable to data augmentations. In contrast, their prior knowledge, as well as radiomic features, is often crucial. To bridge this gap, we propose an end-to-end semi-supervised knowledge-augmented contrastive learning framework, that simultaneously performs disease classification and localization tasks. The key knob of our framework is a unique positive sampling approach tailored for the medical images, by seamlessly integrating radiomic features as a knowledge augmentation. Specifically, we first apply an image encoder to classify the chest X-rays and to generate the image features. We next leverage Grad-CAM to highlight the crucial (abnormal) regions for chest X-rays (even when unannotated), from which we extract radiomic features. The radiomic features are then passed through another dedicated encoder to act as the positive sample for the image features generated from the same chest X-ray. In this way, our framework constitutes a feedback loop for image and radiomic modality features to mutually reinforce each other. Their contrasting yields knowledge-augmented representations that are both robust and interpretable. Extensive experiments on the NIH Chest X-ray dataset demonstrate that our approach outperforms existing baselines in both classification and localization tasks.
翻译:为这些任务建立高度准确的预测模型通常需要大量手工加注标签和异常的像素区域(边框),但获取这样的说明费用昂贵,特别是边框。最近,对比式学习在利用未贴标签的自然图像制作高度笼统和具有歧视性的特征方面显示了巨大的希望。然而,将自身能力扩大到医疗图像领域,探索不足,而且高度非三角,因为医疗图像对数据放大的修改要少得多。相比之下,它们先前的知识以及放射特征往往至关重要。为了缩小这一差距,我们建议采用一个端到端半端的半监督性知识对比学习框架,同时进行疾病分类和本地化任务。我们框架的关键 Knob 是专门为医疗图像定制的独特的积极取样方法,通过将放射学特性作为知识增强。具体地说,我们首先应用一个图像编码来对胸前X光光显示和生成图像特征。我们接下来从淡光-CAM 来强调核心(正常) 和红外红外红外的图像模型,然后通过红心仪显示另一个红外的图像区域。