Few-shot segmentation (FSS) aims to segment novel categories given scarce annotated support images. The crux of FSS is how to aggregate dense correlations between support and query images for query segmentation while being robust to the large variations in appearance and context. To this end, previous Transformer-based methods explore global consensus either on context similarity or affinity map between support-query pairs. In this work, we effectively integrate the context and affinity information via the proposed novel Context and Affinity Transformer (CATrans) in a hierarchical architecture. Specifically, the Relation-guided Context Transformer (RCT) propagates context information from support to query images conditioned on more informative support features. Based on the observation that a huge feature distinction between support and query pairs brings barriers for context knowledge transfer, the Relation-guided Affinity Transformer (RAT) measures attention-aware affinity as auxiliary information for FSS, in which the self-affinity is responsible for more reliable cross-affinity. We conduct experiments to demonstrate the effectiveness of the proposed model, outperforming the state-of-the-art methods.
翻译:微小截面图( FSS) 旨在分割小类小类中缺少附加注释的支持图像。 FSS 的柱石是,如何将支持和查询图像之间的浓密相关性汇总起来,以便查询截面图,同时对外观和背景的巨大差异保持稳健。 为此,以前以变异器为基础的方法探索了关于背景相似性或支持查询对配对之间的亲近性图的全球共识。 在这项工作中,我们通过拟议的小背景和亲近性变异器(CATrans),将背景和亲近性信息有效地整合到一个等级结构中。 具体地说, 关系引导背景变异器(RCT) 将背景信息从支持传播到查询图像,但以信息支持特性的特性特征特征特征特征特征更强,给背景知识转让带来障碍,因此, 关系引导亲近性变异器(RAT) 将关注和亲近性信息作为FSS的辅助信息, 其中自亲近性负责更可靠的交叉性。我们进行了实验,以示范拟议模型的有效性, 超越了状态方法。