Due to the limited and even imbalanced data, semi-supervised semantic segmentation tends to have poor performance on some certain categories, e.g., tailed categories in Cityscapes dataset which exhibits a long-tailed label distribution. Existing approaches almost all neglect this problem, and treat categories equally. Some popular approaches such as consistency regularization or pseudo-labeling may even harm the learning of under-performing categories, that the predictions or pseudo labels of these categories could be too inaccurate to guide the learning on the unlabeled data. In this paper, we look into this problem, and propose a novel framework for semi-supervised semantic segmentation, named adaptive equalization learning (AEL). AEL adaptively balances the training of well and badly performed categories, with a confidence bank to dynamically track category-wise performance during training. The confidence bank is leveraged as an indicator to tilt training towards under-performing categories, instantiated in three strategies: 1) adaptive Copy-Paste and CutMix data augmentation approaches which give more chance for under-performing categories to be copied or cut; 2) an adaptive data sampling approach to encourage pixels from under-performing category to be sampled; 3) a simple yet effective re-weighting method to alleviate the training noise raised by pseudo-labeling. Experimentally, AEL outperforms the state-of-the-art methods by a large margin on the Cityscapes and Pascal VOC benchmarks under various data partition protocols. Code is available at https://github.com/hzhupku/SemiSeg-AEL
翻译:由于数据有限,甚至不平衡,半监督的语义分解在某些类别上表现不佳,例如,市景数据集的尾细分类显示长期标签分布。现有办法几乎都忽视了这一问题,同等对待类别。一些流行办法,如一致性规范化或假标签,甚至可能损害对业绩不佳类别的学习,这些类别的预测或假标签可能太不准确,无法指导对未贴标签数据进行学习。在本文中,我们研究这一问题,并提议一个半监督的语义分解新框架,称为适应性均衡学习(AEL)。AEL适应性平衡了良好和业绩差的类别培训,而信心库可以动态地跟踪在培训期间的类别表现。信心库可以用作指标,将培训转向业绩不佳类别,三种战略即:1)适应性翻贴纸和CutMix数据放大基准,使业绩不佳类别有更多机会被复制或剪切;2 适应性数据取样法在A-LIA类下,通过简化数据取样法在A-ILx下,从简化数据递归为简化的BILx。