零热塞义分解 (Decoupling Zero-Shot Semantic Segmentation) - 专知论文

会员服务 ·

0

Performer · SimPLe · GROUP · MoDELS · 语言模型化 ·

2021 年 12 月 15 日

Decoupling Zero-Shot Semantic Segmentation

翻译：零热塞义分解

Jian Ding,Nan Xue,Gui-Song Xia,Dengxin Dai

from arxiv, 14 pages, 8 figures

Zero-shot semantic segmentation (ZS3) aims to segment the novel categories that have not been seen in the training. Existing works formulate ZS3 as a pixel-level zero-shot classification problem, and transfer semantic knowledge from seen classes to unseen ones with the help of language models pre-trained only with texts. While simple, the pixel-level ZS3 formulation shows the limited capability to integrate vision-language models that are often pre-trained with image-text pairs and currently demonstrate great potential for vision tasks. Inspired by the observation that humans often perform segment-level semantic labeling, we propose to decouple the ZS3 into two sub-tasks: 1) a class-agnostic grouping task to group the pixels into segments. 2) a zero-shot classification task on segments. The former sub-task does not involve category information and can be directly transferred to group pixels for unseen classes. The latter subtask performs at segment-level and provides a natural way to leverage large-scale vision-language models pre-trained with image-text pairs (e.g. CLIP) for ZS3. Based on the decoupling formulation, we propose a simple and effective zero-shot semantic segmentation model, called ZegFormer, which outperforms the previous methods on ZS3 standard benchmarks by large margins, e.g., 35 points on the PASCAL VOC and 3 points on the COCO-Stuff in terms of mIoU for unseen classes. Code will be released at https://github.com/dingjiansw101/ZegFormer.

翻译：零点语义分解( ZS3 ) 旨在分割在培训中未看到的新分类。现有的工程将 ZS3 编成像素级零点分类问题, 并在语言模型的帮助下将语义学知识从可见类转换到看不见类, 只对文本进行预培训。简单的是, 像素级 ZS3 的配方表明, 整合愿景语言模型的能力有限, 这些模型通常经过图像文本配对的预先培训, 目前展示了巨大的视觉任务潜力。受人类经常执行部分级语义标签的观察的启发, 我们建议将 ZS3 调成两个子任务 :1) 类语义组, 将像素组分组, 仅对文本进行预培训。 2 片段的零点分类任务。以前的像类信息不包含类信息, 并且可以直接传输到不可见类类的类类像素。后一个子级塔克在分级上进行演练, 并且提供了一种自然的方式, 利用大型视觉模型模型进行分级的分级。 SS3 级 Sag- fre- frecial- sucial suder- seal sailding the degradustration, 在Silding the supdustrual- suplate- suplations

0

相关内容

Performer

【NeuraIPS2021】HSVA:面向零样本学习的分层语义视觉自适应

专知会员服务

13+阅读 · 2021年10月1日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【google】监督对比学习，Supervised Contrastive Learning

【google】监督对比学习，Supervised Contrastive Learning

专知会员服务

32+阅读 · 2020年4月23日

【CVPR2020-微软-CMU】视频物体分割的一种直推方法，Video Object Segmentation

【CVPR2020-微软-CMU】视频物体分割的一种直推方法，Video Object Segmentation

专知会员服务

7+阅读 · 2020年4月16日

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

专知会员服务

76+阅读 · 2020年4月10日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【CVPR2020】从未标记的视频中学习视频对象分割，Learning Video Object Segmentation from Unlabeled Videos

【CVPR2020】从未标记的视频中学习视频对象分割，Learning Video Object Segmentation from Unlabeled Videos

专知会员服务

36+阅读 · 2020年3月12日

【图像分割| 2019最新综述】自然图像和医学图像的深层语义分割，附21页PDF（Deep Semantic Segmentation of Natural and Medical Images: A Review）

【图像分割| 2019最新综述】自然图像和医学图像的深层语义分割，附21页PDF（Deep Semantic Segmentation of Natural and Medical Images: A Review）

专知会员服务

54+阅读 · 2019年11月16日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

已删除

无人机

3+阅读 · 2019年3月4日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【简评】[CVPR2017]Loss Max-Pooling for Semantic Image Segmentation

【简评】[CVPR2017]Loss Max-Pooling for Semantic Image Segmentation

极市平台

5+阅读 · 2017年6月15日

Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning

Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning

Arxiv

10+阅读 · 2021年10月11日

Leveraging Auxiliary Tasks with Affinity Learning for Weakly Supervised Semantic Segmentation

Arxiv

3+阅读 · 2021年7月27日

Zero-Shot Instance Segmentation

Arxiv

8+阅读 · 2021年6月1日

Unsupervised Domain Adaptation for Semantic Segmentation by Content Transfer

Arxiv

4+阅读 · 2020年12月23日

Learning Dynamic Routing for Semantic Segmentation

Learning Dynamic Routing for Semantic Segmentation

Arxiv

8+阅读 · 2020年3月23日

Object-Contextual Representations for Semantic Segmentation

Object-Contextual Representations for Semantic Segmentation

Arxiv

7+阅读 · 2019年11月19日

Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation

Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation

Arxiv

4+阅读 · 2019年7月4日

Integrating Semantic Knowledge to Tackle Zero-shot Text Classification

Arxiv

6+阅读 · 2019年3月29日

Learning Instance Segmentation by Interaction

Arxiv

6+阅读 · 2018年6月21日

Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Arxiv

9+阅读 · 2018年3月13日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

【NeuraIPS2021】HSVA:面向零样本学习的分层语义视觉自适应

专知会员服务

13+阅读 · 2021年10月1日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【google】监督对比学习，Supervised Contrastive Learning

【google】监督对比学习，Supervised Contrastive Learning

专知会员服务

32+阅读 · 2020年4月23日

【CVPR2020-微软-CMU】视频物体分割的一种直推方法，Video Object Segmentation

【CVPR2020-微软-CMU】视频物体分割的一种直推方法，Video Object Segmentation

专知会员服务

7+阅读 · 2020年4月16日

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

专知会员服务

76+阅读 · 2020年4月10日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【CVPR2020】从未标记的视频中学习视频对象分割，Learning Video Object Segmentation from Unlabeled Videos

【CVPR2020】从未标记的视频中学习视频对象分割，Learning Video Object Segmentation from Unlabeled Videos

专知会员服务

36+阅读 · 2020年3月12日

【图像分割| 2019最新综述】自然图像和医学图像的深层语义分割，附21页PDF（Deep Semantic Segmentation of Natural and Medical Images: A Review）

【图像分割| 2019最新综述】自然图像和医学图像的深层语义分割，附21页PDF（Deep Semantic Segmentation of Natural and Medical Images: A Review）

专知会员服务

54+阅读 · 2019年11月16日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美国海军陆战队软件定义网络应用案例：分布式防火墙自动化系统》148页

《多体环境下定位导航授时（PNT）系统研究》228页

软件定义无线电（SDR）：商业与军事领域的技术、应用及未来趋势

《攻势防空作战中无人追击者/规避者最优轨迹研究（含动态交战区建模）》95页

相关资讯

已删除

无人机

3+阅读 · 2019年3月4日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【简评】[CVPR2017]Loss Max-Pooling for Semantic Image Segmentation

【简评】[CVPR2017]Loss Max-Pooling for Semantic Image Segmentation

极市平台

5+阅读 · 2017年6月15日

相关论文

Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning

Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning

Arxiv

10+阅读 · 2021年10月11日

Leveraging Auxiliary Tasks with Affinity Learning for Weakly Supervised Semantic Segmentation

Arxiv

3+阅读 · 2021年7月27日

Zero-Shot Instance Segmentation

Arxiv

8+阅读 · 2021年6月1日

Unsupervised Domain Adaptation for Semantic Segmentation by Content Transfer

Arxiv

4+阅读 · 2020年12月23日

Learning Dynamic Routing for Semantic Segmentation

Learning Dynamic Routing for Semantic Segmentation

Arxiv

8+阅读 · 2020年3月23日

Object-Contextual Representations for Semantic Segmentation

Object-Contextual Representations for Semantic Segmentation

Arxiv

7+阅读 · 2019年11月19日

Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation

Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation

Arxiv

4+阅读 · 2019年7月4日

Integrating Semantic Knowledge to Tackle Zero-shot Text Classification

Arxiv

6+阅读 · 2019年3月29日

Learning Instance Segmentation by Interaction

Arxiv

6+阅读 · 2018年6月21日

Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Arxiv

9+阅读 · 2018年3月13日

微信扫码咨询专知VIP会员