Training semantic segmentation models requires a large amount of finely annotated data, making it hard to quickly adapt to novel classes not satisfying this condition. Few-Shot Segmentation (FS-Seg) tackles this problem with many constraints. In this paper, we introduce a new benchmark, called Generalized Few-Shot Semantic Segmentation (GFS-Seg), to analyze the generalization ability of simultaneously segmenting the novel categories with very few examples and the base categories with sufficient examples. It is the first study showing that previous representative state-of-the-art FS-Seg methods fall short in GFS-Seg and the performance discrepancy mainly comes from the constrained setting of FS-Seg. To make GFS-Seg tractable, we set up a GFS-Seg baseline that achieves decent performance without structural change on the original model. Then, since context is essential for semantic segmentation, we propose the Context-Aware Prototype Learning (CAPL) that significantly improves performance by 1) leveraging the co-occurrence prior knowledge from support samples, and 2) dynamically enriching contextual information to the classifier, conditioned on the content of each query image. Both two contributions are experimentally shown to have substantial practical merit. Extensive experiments on Pascal-VOC and COCO manifest the effectiveness of CAPL, and CAPL generalizes well to FS-Seg by achieving competitive performance. Code is available at https://github.com/dvlab-research/GFS-Seg.
翻译:培训语义分解模型需要大量精细附加说明的数据,因此很难快速适应不符合此条件的新类。 少点点偏移( FS- Seg) 以许多限制来解决这个问题。 在本文中, 我们引入了一个新的基准, 称为“ 通用的少点偏移语分解( GFS- Seg) ” (GFS- Seg), 以分析同时分解小类的概括能力, 并举几个例子和基础类别。 这是第一项研究, 显示前代代表最先进的FS- Seg 方法在GFS- Seg 中落后, 而绩效差异主要来自FS- Seg 的制约设置。 为了让 GFS- Seg 调控缩, 我们设置了一个GFS-S- Seg 基线, 在原始模型没有结构变化的情况下实现体面的性能。 接着,我们提议以背景- Aware Prototy Learning (C) 来显著改进业绩, 利用支持样本的CO- clocal- Vealalal- exalalalalal reearrial sess exildalal exismal dealmentalmental 。