In this work, we propose a novel straightforward method for medical volume and sequence segmentation with limited annotations. To avert laborious annotating, the recent success of self-supervised learning(SSL) motivates the pre-training on unlabeled data. Despite its success, it is still challenging to adapt typical SSL methods to volume/sequence segmentation, due to their lack of mining on local semantic discrimination and rare exploitation on volume and sequence structures. Based on the continuity between slices/frames and the common spatial layout of organs across volumes/sequences, we introduced a novel bootstrap self-supervised representation learning method by leveraging the predictable possibility of neighboring slices. At the core of our method is a simple and straightforward dense self-supervision on the predictions of local representations and a strategy of predicting locals based on global context, which enables stable and reliable supervision for both global and local representation mining among volumes. Specifically, we first proposed an asymmetric network with an attention-guided predictor to enforce distance-specific prediction and supervision on slices within and across volumes/sequences. Secondly, we introduced a novel prototype-based foreground-background calibration module to enhance representation consistency. The two parts are trained jointly on labeled and unlabeled data. When evaluated on three benchmark datasets of medical volumes and sequences, our model outperforms existing methods with a large margin of 4.5\% DSC on ACDC, 1.7\% on Prostate, and 2.3\% on CAMUS. Intensive evaluations reveals the effectiveness and superiority of our method.
翻译:在这项工作中,我们提出了一个新的直接的方法,用于医疗量和序列的分解,并有有限的说明。为了避免困难的注释,最近自我监督学习的成功激励了未贴标签数据的培训前阶段。尽管取得了成功,但我们仍然难以将典型的SSL方法调整为数量/顺序分解,因为当地语义歧视没有采矿,数量和顺序结构的稀有利用。根据切片/框架和不同量/顺序器官共同空间布局之间的连续性,我们采用了一种新的自我监督代表制学习方法,利用可预见到的相邻切片的可能性。我们的方法的核心是简单和直截了当的自我监督本地代表制预测和基于全球背景预测/顺序的预测当地代表制战略。具体地说,我们首先提出了一个不对称的网络模式,以关注制导的频率预测和跨卷/序列对切片进行远程预测和监督。第二,我们用经过培训的CLisloral标准格式的大规模模型和CLisloral标准,我们用最新版本的Caldalbal 格式对目前版本数据进行了联合的版本进行校准。