用变换器推动少发的语义分割 (Boosting Few-shot Semantic Segmentation with Transformers)

Due to the fact that fully supervised semantic segmentation methods require sufficient fully-labeled data to work well and can not generalize to unseen classes, few-shot segmentation has attracted lots of research attention. Previous arts extract features from support and query images, which are processed jointly before making predictions on query images. The whole process is based on convolutional neural networks (CNN), leading to the problem that only local information is used. In this paper, we propose a TRansformer-based Few-shot Semantic segmentation method (TRFS). Specifically, our model consists of two modules: Global Enhancement Module (GEM) and Local Enhancement Module (LEM). GEM adopts transformer blocks to exploit global information, while LEM utilizes conventional convolutions to exploit local information, across query and support features. Both GEM and LEM are complementary, helping to learn better feature representations for segmenting query images. Extensive experiments on PASCAL-5i and COCO datasets show that our approach achieves new state-of-the-art performance, demonstrating its effectiveness.

翻译：由于充分监督的语义分解方法需要足够的全标签数据才能很好地发挥作用,而且不能将数据推广到看不见的类别,少数截肢已经引起了许多研究关注。从支持和查询图像中提取的以往艺术特征,在对查询图像作出预测之前是共同处理的。整个过程都基于进化神经网络(CNN),导致只使用当地信息的问题。在本文中,我们建议采用基于TRansformex的少发分解方法(TRFS)。具体地说,我们的模型由两个模块组成:全球增强模块(GEM)和地方增强模块(LEM)。GEM采用变压器块来利用全球信息,而LEM则利用传统的组合来利用本地信息,跨越查询和支持功能。GEM和LEM都是互补的,有助于为分解查询图像学习更好的特征描述。关于PASAL-5i和COCO数据集的广泛实验表明,我们的方法取得了新的状态性能,显示了其有效性。

相关内容

小样本学习

关注 215

小样本学习（Few-Shot Learning，以下简称 FSL ）用于解决当可用的数据量比较少时，如何提升神经网络的性能。在 FSL 中，经常用到的一类方法被称为 Meta-learning。和普通的神经网络的训练方法一样，Meta-learning 也包含训练过程和测试过程，但是它的训练过程被称作 Meta-training 和 Meta-testing。

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

小目标检测技术研究综述

专知会员服务

123+阅读 · 2020年12月7日

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

【复旦大学邱锡鹏教授】自然语言处理中的自注意力模型，53页ppt

专知会员服务

129+阅读 · 2020年9月2日