Traditional semantic segmentation requires a large labeled image dataset and can only be predicted within predefined classes. To solve this problem, few-shot segmentation, which requires only a handful of annotations for the new target class, is important. However, with few-shot segmentation, the target class data distribution in the feature space is sparse and has low coverage because of the slight variations in the sample data. Setting the classification boundary that properly separates the target class from other classes is an impossible task. In particular, it is difficult to classify classes that are similar to the target class near the boundary. This study proposes the Interclass Prototype Relation Network (IPRNet), which improves the separation performance by reducing the similarity between other classes. We conducted extensive experiments with Pascal-5i and COCO-20i and showed that IPRNet provides the best segmentation performance compared with previous research.
翻译:传统的语义分解需要一个大标记的图像数据集,并且只能在预设的分类中预测。 要解决这个问题,只需对新目标类别略加说明即可进行微小的分解。然而,通过几发分解,地物空间的目标级数据分布很稀少,由于抽样数据略有差异,其覆盖范围较低。确定将目标类别与其他类别适当区分的分类界限是一项不可能的任务。特别是,很难对与边界附近目标类别相类似的类别进行分类。本研究建议采用跨类原型关系网络(IPRNet),通过减少其他类别之间的相似性来改进分离性能。我们与Pascal-5i和COCO-20i进行了广泛的实验,并表明PIPNet提供了与以往研究相比的最佳分解性表现。