《从未来学习:关于语区分割的新自培训框架》 (Learning from Future: A Novel Self-Training Framework for Semantic Segmentation)

Self-training has shown great potential in semi-supervised learning. Its core idea is to use the model learned on labeled data to generate pseudo-labels for unlabeled samples, and in turn teach itself. To obtain valid supervision, active attempts typically employ a momentum teacher for pseudo-label prediction yet observe the confirmation bias issue, where the incorrect predictions may provide wrong supervision signals and get accumulated in the training process. The primary cause of such a drawback is that the prevailing self-training framework acts as guiding the current state with previous knowledge, because the teacher is updated with the past student only. To alleviate this problem, we propose a novel self-training strategy, which allows the model to learn from the future. Concretely, at each training step, we first virtually optimize the student (i.e., caching the gradients without applying them to the model weights), then update the teacher with the virtual future student, and finally ask the teacher to produce pseudo-labels for the current student as the guidance. In this way, we manage to improve the quality of pseudo-labels and thus boost the performance. We also develop two variants of our future-self-training (FST) framework through peeping at the future both deeply (FST-D) and widely (FST-W). Taking the tasks of unsupervised domain adaptive semantic segmentation and semi-supervised semantic segmentation as the instances, we experimentally demonstrate the effectiveness and superiority of our approach under a wide range of settings. Code will be made publicly available.

翻译：自我培训在半监督的学习中表现出了巨大的潜力。其核心理念是使用在标签数据上学习的模型为未贴标签的样本生成假标签,然后进行自我教育。为了获得有效的监督, 积极的尝试通常会使用假标签预测的势头教师, 同时又观察确认偏差问题, 错误的预测可能会提供错误的监督信号, 并在培训过程中积累。这种缺陷的主要原因是, 流行的自培训框架会以先前的知识指导当前的状况, 因为教师只是与过去的学生一起更新。为了缓解这一问题, 我们提出了一个全新的自我培训战略, 让模型能够从未来学习。具体地说, 在每次培训阶段, 我们首先几乎优化学生( 将梯度涂在不应用模型重量的情况下), 然后用虚拟的未来学生向教师更新假标签, 最后让教师为当前学生制作假标签作为指导。这样, 我们设法提高假标签的质量, 从而提升业绩。我们还在将来的SOV-S- Stial化框架中, (ST- Streal-Serial Stal-deal-trading the seal- Seal- delvial- lavial- lavial- laview- Seal- laview- laview- laview-Si-Sial- laview-Sial-Sial-Sial-Stial- Stal- laview-S-S-Stial-SIT-SIT-SIT-SIT-Stial-Stial-Stial-Stial-Stitionaltradingdal- lading-Stiction-Stiction-Stiction-SIT-Stiction-SIT-SIT-SIT-SIT-SIT-SIT-SIT-SIT-SIT-Stial-Stial-tra-tra-S-S-S-S-S-S-S-STI-STIal-Stitra-SIT-SIT-S-SIT-SIT-SIT-SIT-S-S-S-S-S-S-S-S-S-S-S-SIT-S-SIT-SIT-S-S-S-S-S-S