Purpose: In curriculum learning, the idea is to train on easier samples first and gradually increase the difficulty, while in self-paced learning, a pacing function defines the speed to adapt the training progress. While both methods heavily rely on the ability to score the difficulty of data samples, an optimal scoring function is still under exploration. Methodology: Distillation is a knowledge transfer approach where a teacher network guides a student network by feeding a sequence of random samples. We argue that guiding student networks with an efficient curriculum strategy can improve model generalization and robustness. For this purpose, we design an uncertainty-based paced curriculum learning in self distillation for medical image segmentation. We fuse the prediction uncertainty and annotation boundary uncertainty to develop a novel paced-curriculum distillation (PCD). We utilize the teacher model to obtain prediction uncertainty and spatially varying label smoothing with Gaussian kernel to generate segmentation boundary uncertainty from the annotation. We also investigate the robustness of our method by applying various types and severity of image perturbation and corruption. Results: The proposed technique is validated on two medical datasets of breast ultrasound image segmentation and robotassisted surgical scene segmentation and achieved significantly better performance in terms of segmentation and robustness. Conclusion: P-CD improves the performance and obtains better generalization and robustness over the dataset shift. While curriculum learning requires extensive tuning of hyper-parameters for pacing function, the level of performance improvement suppresses this limitation.
翻译:在课程学习中,我们的想法是首先训练比较容易的样本,然后逐渐增加难度,而在自定节奏的学习中,一个节奏功能决定了适应培训进展的速度。虽然两种方法都严重依赖对数据样品的难度进行分辨的能力,但最佳评分功能仍在探索中。方法:蒸馏是一种知识转让方法,教师网络通过输入随机抽样序列来引导学生网络。我们提出,指导学生网络采用有效的课程战略可以改进模式的概括性和稳健性。为此,我们设计了一个基于不确定性的节奏课程,学习医学图像分解的自我蒸馏过程。我们结合了预测不确定性和注释边界不确定性,以发展新的曲解样蒸馏速度(PCD)。我们利用教师模型来获得预测不确定性和空间差异性标签,以便通过输入随机样本来产生分解边界的不确定性。我们还通过应用各种类型和程度的图像分解和腐败来调查我们方法的稳健性。结果:拟议的技术在两次医学数据分解中得到了更稳健性的数据分层的精确性调整。我们利用了两种医学分层的精确性,从而改进了超硬性平的分层性分析。