Semi-supervised semantic segmentation focuses on the exploration of a small amount of labeled data and a large amount of unlabeled data, which is more in line with the demands of real-world image understanding applications. However, it is still hindered by the inability to fully and effectively leverage unlabeled images. In this paper, we reveal that cross-window consistency (CWC) is helpful in comprehensively extracting auxiliary supervision from unlabeled data. Additionally, we propose a novel CWC-driven progressive learning framework to optimize the deep network by mining weak-to-strong constraints from massive unlabeled data. More specifically, this paper presents a biased cross-window consistency (BCC) loss with an importance factor, which helps the deep network explicitly constrain confidence maps from overlapping regions in different windows to maintain semantic consistency with larger contexts. In addition, we propose a dynamic pseudo-label memory bank (DPM) to provide high-consistency and high-reliability pseudo-labels to further optimize the network. Extensive experiments on three representative datasets of urban views, medical scenarios, and satellite scenes demonstrate our framework consistently outperforms the state-of-the-art methods with a large margin. Code will be available publicly.
翻译:半监督的语义分割法侧重于探索少量贴标签数据和大量未贴标签的数据,这更符合真实世界图像理解应用程序的要求,然而,由于无法充分和有效地利用未贴标签图像,仍然受到阻碍。在本文中,我们发现,跨窗口一致性有助于从无标签数据中全面提取辅助监督。此外,我们提议了一个由《化学武器公约》驱动的新颖的渐进学习框架,以优化深网络,从大量未贴标签数据中挖掘薄弱至强的制约因素。更具体地说,本文件展示了一个重要因素,有偏向的跨窗口一致性(BCC)损失,这有助于深网络明确限制不同窗口重叠区域的信任图,以保持与大环境的语义一致性。此外,我们提议建立一个动态的假标签存储库(DPM),以提供高度一致和高可靠性的伪标签,进一步优化网络。关于具有代表性的城市观点、医疗情景和卫星场景的三个数据集的广泛实验,将展示我们框架的架构将持续超越现有的大比例。