For medical image segmentation, contrastive learning is the dominant practice to improve the quality of visual representations by contrasting semantically similar and dissimilar pairs of samples. This is enabled by the observation that without accessing ground truth label, negative examples with truly dissimilar anatomical features, if sampled, can significantly improve the performance. In reality, however, these samples may come from similar anatomical features and the models may struggle to distinguish the minority tail-class samples, making the tail classes more prone to misclassification, both of which typically lead to model collapse. In this paper, we propose ARCO, a semi-supervised contrastive learning (CL) framework with stratified group sampling theory in medical image segmentation. In particular, we first propose building ARCO through the concept of variance-reduced estimation, and show that certain variance-reduction techniques are particularly beneficial in medical image segmentation tasks with extremely limited labels. Furthermore, we theoretically prove these sampling techniques are universal in variance reduction. Finally, we experimentally validate our approaches on three benchmark datasets with different label settings, and our methods consistently outperform state-of-the-art semi- and fully-supervised methods. Additionally, we augment the CL frameworks with these sampling techniques and demonstrate significant gains over previous methods. We believe our work is an important step towards semi-supervised medical image segmentation by quantifying the limitation of current self-supervision objectives for accomplishing medical image analysis tasks.
翻译:对于医疗图象分解而言,对比式学习是提高视觉表现质量的主要做法,通过对比结构相似和不同样本的对比体来提高视觉表现质量。这得益于以下观察:如果不使用地面真理标签,如果抽样,则具有真正不同解剖特征的负面例子,如果抽样,就能大大改善性能。然而,在现实中,这些样品可能来自类似的解剖特征,模型可能难以区分少数尾类样本,使尾类更容易被错误分类,这通常会导致模型崩溃。在本文中,我们提议采用半监督的对比学习框架,即医学图象分解中带有分级群体抽样理论的半监督对比学习框架。特别是,我们首先提议通过差异化估计概念来建立ARCO,并表明某些减少差异的技术特别有利于医疗图象分解任务,而标签极为有限。此外,我们从理论上证明这些抽样技术在减少差异方面是普遍的。最后,我们实验性地验证了我们在三个基准数据集上采用的不同标签设置的方法,而我们采用的方法在医学图象分级结构上持续超越了我们目前的自我分析方法,我们完全相信这些重要的前的半级分析方法。