Training a neural network with a large labeled dataset is still a dominant paradigm in computational histopathology. However, obtaining such exhaustive manual annotations is often expensive, laborious, and prone to inter and Intra-observer variability. While recent self-supervised and semi-supervised methods can alleviate this need by learn-ing unsupervised feature representations, they still struggle to generalize well to downstream tasks when the number of labeled instances is small. In this work, we overcome this challenge by leveraging both task-agnostic and task-specific unlabeled data based on two novel strategies: i) a self-supervised pretext task that harnesses the underlying multi-resolution contextual cues in histology whole-slide images to learn a powerful supervisory signal for unsupervised representation learning; ii) a new teacher-student semi-supervised consistency paradigm that learns to effectively transfer the pretrained representations to downstream tasks based on prediction consistency with the task-specific un-labeled data. We carry out extensive validation experiments on three histopathology benchmark datasets across two classification and one regression-based tasks, i.e., tumor metastasis detection, tissue type classification, and tumor cellularity quantification. Under limited-label data, the proposed method yields tangible improvements, which is close or even outperforming other state-of-the-art self-supervised and supervised baselines. Furthermore, we empirically show that the idea of bootstrapping the self-supervised pretrained features is an effective way to improve the task-specific semi-supervised learning on standard benchmarks. Code and pretrained models will be made available at: https://github.com/srinidhiPY/SSL_CR_Histo
翻译:具有大量标签的数据集的神经网络培训仍然是计算生理病理学中的主要模式。然而,获得此类详尽的手动说明往往费用昂贵、艰苦、容易发生内部和内部观察者的变异性。虽然最近的自监管和半监管方法可以通过学习不受监督的特征演示来缓解这一需求,但在标签实例数量少时,它们仍然难以向下游任务全面推广。在这项工作中,我们通过利用基于以下两个新战略的P-SS型和特定任务无标签数据克服了这一挑战:i)自监管的托辞任务,利用其基础的多分辨率背景提示在组织整体和内部观察者的图像中学习一个强大的监督信号,以便进行不受监督的演示;ii)新的教师-学生半监管一致性模式,在标签实例数量少时,它们学会将预先测试的表述有效地转移到下游任务。我们根据以下两个新的战略,即i) 自我监督的自我监督的借口任务基准模型的自我测试,甚至通过两个系统内部分析的自我评估的自我测试和一个系统化的自我分析工具,在测试中,在测试中,在测试和精确的排序中,在排序中,在排序中,在排序中,在排序中,在排序和排序中,在排序中,在排序中,在排序中,在排序中,在排序中,在排序中,在排序和排序中,在排序中,在排序中,在排序中,我们。