Contrastive learning, which aims to capture general representation from unlabeled images to initialize the medical analysis models, has been proven effective in alleviating the high demand for expensive annotations. Current methods mainly focus on instance-wise comparisons to learn the global discriminative features, however, pretermitting the local details to distinguish tiny anatomical structures, lesions, and tissues. To address this challenge, in this paper, we propose a general unsupervised representation learning framework, named local discrimination (LD), to learn local discriminative features for medical images by closely embedding semantically similar pixels and identifying regions of similar structures across different images. Specifically, this model is equipped with an embedding module for pixel-wise embedding and a clustering module for generating segmentation. And these two modules are unified through optimizing our novel region discrimination loss function in a mutually beneficial mechanism, which enables our model to reflect structure information as well as measure pixel-wise and region-wise similarity. Furthermore, based on LD, we propose a center-sensitive one-shot landmark localization algorithm and a shape-guided cross-modality segmentation model to foster the generalizability of our model. When transferred to downstream tasks, the learned representation by our method shows a better generalization, outperforming representation from 18 state-of-the-art (SOTA) methods and winning 9 out of all 12 downstream tasks. Especially for the challenging lesion segmentation tasks, the proposed method achieves significantly better performances. The source codes are publicly available at https://github.com/HuaiChen-1994/LDLearning.
翻译:旨在从未贴标签的图像中获取总体代表性以启动医疗分析模型的对比性学习,已证明在缓解对昂贵说明的高需求方面行之有效。当前方法主要侧重于实例比较,以学习全球歧视特征,然而,预先确定本地细节,以区分细小解剖结构、损伤和组织。为了应对这一挑战,我们在本文件中提议了一个通用的未经监督的代表性学习框架,称为地方歧视(LD),以通过密切嵌入语义相似的像素和辨别不同图像中类似结构的区域来学习当地对医疗图像的区别性特征。具体地说,这一模式以实例比较的比较性比较性比较为主,为像素嵌嵌入和组组合模块学习全球差异化特点。这两个模块通过优化我们新的地区解析损失功能,在相互有益的机制中,通过我们的模型反映结构信息,以及测量像素顺流和区域相似性。此外,根据LD,我们提出了一种对中心敏感的、直观、直截线的本地化算法和形状导出不同图像的跨式结构结构。具体地,在生成的分解方法上,我们现有的分解的分解方法可以促进整个系统化的分解。