多模态语义分割中缺失模态稳健性的半监督研究 (Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic Segmentation) - 专知论文

会员服务 ·

0

模态 · 稳健 · 多模 · 稳健性 · 半监督 ·

2023 年 4 月 21 日

Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic Segmentation

翻译：多模态语义分割中缺失模态稳健性的半监督研究

Harsh Maheshwari,Yen-Cheng Liu,Zsolt Kira

Using multiple spatial modalities has been proven helpful in improving semantic segmentation performance. However, there are several real-world challenges that have yet to be addressed: (a) improving label efficiency and (b) enhancing robustness in realistic scenarios where modalities are missing at the test time. To address these challenges, we first propose a simple yet efficient multi-modal fusion mechanism Linear Fusion, that performs better than the state-of-the-art multi-modal models even with limited supervision. Second, we propose M3L: Multi-modal Teacher for Masked Modality Learning, a semi-supervised framework that not only improves the multi-modal performance but also makes the model robust to the realistic missing modality scenario using unlabeled data. We create the first benchmark for semi-supervised multi-modal semantic segmentation and also report the robustness to missing modalities. Our proposal shows an absolute improvement of up to 10% on robust mIoU above the most competitive baselines. Our code is available at https://github.com/harshm121/M3L

翻译：使用多个空间模态已被证明有助于提高语义分割性能。然而，尚有几个现实世界中需要解决的挑战：（a）提高标签效率和（b）增强在测试时模态缺失的现实场景下的稳健性。为了解决这些问题，我们首先提出了一种简单而有效的多模态融合机制线性融合，即使受到有限监督，其表现也比最先进的多模态模型更好。其次，我们提出了M3L：用于屏蔽性模态学习的多模态教师，这是一种半监督框架，不仅提高了多模态性能，而且使用未标记的数据使模型在现实世界中的缺失模态场景中具有稳健性。我们创建了半监督多模态语义分割的第一个基准，并报告了对缺失模态的鲁棒性。我们的提议在最具竞争力的基线上比稳健mIoU至少提高了10％。我们的代码可在 https://github.com/harshm121/M3L 上获得。

0

相关内容

【CVPR2023】带缺失模态多模态提示的视觉识别

【CVPR2023】带缺失模态多模态提示的视觉识别

专知会员服务

23+阅读 · 2023年3月10日

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

专知会员服务

18+阅读 · 2022年3月19日

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

专知会员服务

21+阅读 · 2022年3月18日

【CVPR 2022】单黑箱和多黑箱预测的领域适应，DINE: Domain Adaptation from Single and Multiple Black-box Predictors

【CVPR 2022】单黑箱和多黑箱预测的领域适应，DINE: Domain Adaptation from Single and Multiple Black-box Predictors

专知会员服务

14+阅读 · 2022年3月12日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

专知会员服务

76+阅读 · 2020年4月10日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【CVPR2020】从未标记的视频中学习视频对象分割，Learning Video Object Segmentation from Unlabeled Videos

【CVPR2020】从未标记的视频中学习视频对象分割，Learning Video Object Segmentation from Unlabeled Videos

专知会员服务

36+阅读 · 2020年3月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【ICDAR2019教程】计算机视觉中的文本形式，Vision and Language: the text modality in computer vision

【ICDAR2019教程】计算机视觉中的文本形式，Vision and Language: the text modality in computer vision

专知会员服务

25+阅读 · 2019年9月21日

CVPR2019| 05-17更新11篇论文及代码合集（含一篇oral，视觉跟踪/实例分割/行人重识别等）

CVPR2019| 05-17更新11篇论文及代码合集（含一篇oral，视觉跟踪/实例分割/行人重识别等）

极市平台

11+阅读 · 2019年5月17日

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

AI研习社

32+阅读 · 2019年4月5日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

泡泡机器人SLAM

11+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

机器学习研究会

12+阅读 · 2018年1月27日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

稀土元素对FeGa合金性能影响机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向高效供电和多端相互支撑的交直流混联配电运行控制研究

国家自然科学基金

0+阅读 · 2013年12月31日

主题模型建模框架下的高分辨率遥感影像半监督分类研究

国家自然科学基金

0+阅读 · 2013年12月31日

切换线性系统的若干动力学性质研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于风险测度的供应链鲁棒建模与策略研究

国家自然科学基金

2+阅读 · 2012年12月31日

Nb元素对TiAl合金高温疲劳性能的影响

国家自然科学基金

0+阅读 · 2012年12月31日

半监督半配对高维多表示数据的降维及拓展研究

国家自然科学基金

0+阅读 · 2011年12月31日

Delta 5 Stat5a与乳腺癌: Delta 5 Stat5a的全基因组结合位点分析及其表观基因组学研究

国家自然科学基金

0+阅读 · 2011年12月31日

积分几何与凸几何分析

国家自然科学基金

2+阅读 · 2009年12月31日

缺失数据下部分线性单指标模型的经验似然推断

国家自然科学基金

0+阅读 · 2009年12月31日

Modality-Agnostic Learning for Medical Image Segmentation Using Multi-modality Self-distillation

Arxiv

0+阅读 · 2023年6月6日

DiffuseExpand: Expanding dataset for 2D medical image segmentation using diffusion models

Arxiv

0+阅读 · 2023年6月6日

Semantic Segmentation on VSPW Dataset through Contrastive Loss and Multi-dataset Training Approach

Arxiv

0+阅读 · 2023年6月6日

SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation

Arxiv

0+阅读 · 2023年6月6日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds

Arxiv

11+阅读 · 2019年11月25日

nnU-Net: Self-adapting Framework for U-Net-Based Medical Image Segmentation

Arxiv

12+阅读 · 2018年9月27日

A 3D Coarse-to-Fine Framework for Volumetric Medical Image Segmentation

A 3D Coarse-to-Fine Framework for Volumetric Medical Image Segmentation

Arxiv

15+阅读 · 2018年8月2日

W-net: Bridged U-net for 2D Medical Image Segmentation

W-net: Bridged U-net for 2D Medical Image Segmentation

Arxiv

20+阅读 · 2018年7月12日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

10+阅读 · 2018年3月20日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR2023】带缺失模态多模态提示的视觉识别

【CVPR2023】带缺失模态多模态提示的视觉识别

专知会员服务

23+阅读 · 2023年3月10日

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

【CVPR 2022】基于实例深度估计的统一深度感知全景分割 PanopticDepth: Per-Instance Depth Estimation for Unified Depth-Aware Panoptic Segmentation

专知会员服务

18+阅读 · 2022年3月19日

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

专知会员服务

21+阅读 · 2022年3月18日

【CVPR 2022】单黑箱和多黑箱预测的领域适应，DINE: Domain Adaptation from Single and Multiple Black-box Predictors

【CVPR 2022】单黑箱和多黑箱预测的领域适应，DINE: Domain Adaptation from Single and Multiple Black-box Predictors

专知会员服务

14+阅读 · 2022年3月12日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

【CVPR2020-中科院计算所】弱监督语义分割的自监督等价注意力机制，Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

专知会员服务

76+阅读 · 2020年4月10日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【CVPR2020】从未标记的视频中学习视频对象分割，Learning Video Object Segmentation from Unlabeled Videos

【CVPR2020】从未标记的视频中学习视频对象分割，Learning Video Object Segmentation from Unlabeled Videos

专知会员服务

36+阅读 · 2020年3月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【ICDAR2019教程】计算机视觉中的文本形式，Vision and Language: the text modality in computer vision

【ICDAR2019教程】计算机视觉中的文本形式，Vision and Language: the text modality in computer vision

专知会员服务

25+阅读 · 2019年9月21日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

CVPR2019| 05-17更新11篇论文及代码合集（含一篇oral，视觉跟踪/实例分割/行人重识别等）

CVPR2019| 05-17更新11篇论文及代码合集（含一篇oral，视觉跟踪/实例分割/行人重识别等）

极市平台

11+阅读 · 2019年5月17日

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

Github项目推荐 | 语义分割、实例分割、全景分割和视频分割的论文和基准列表

AI研习社

32+阅读 · 2019年4月5日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

泡泡机器人SLAM

11+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

机器学习研究会

12+阅读 · 2018年1月27日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

相关论文

Modality-Agnostic Learning for Medical Image Segmentation Using Multi-modality Self-distillation

Arxiv

0+阅读 · 2023年6月6日

DiffuseExpand: Expanding dataset for 2D medical image segmentation using diffusion models

Arxiv

0+阅读 · 2023年6月6日

Semantic Segmentation on VSPW Dataset through Contrastive Loss and Multi-dataset Training Approach

Arxiv

0+阅读 · 2023年6月6日

SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation

Arxiv

0+阅读 · 2023年6月6日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds

Arxiv

11+阅读 · 2019年11月25日

nnU-Net: Self-adapting Framework for U-Net-Based Medical Image Segmentation

Arxiv

12+阅读 · 2018年9月27日

A 3D Coarse-to-Fine Framework for Volumetric Medical Image Segmentation

A 3D Coarse-to-Fine Framework for Volumetric Medical Image Segmentation

Arxiv

15+阅读 · 2018年8月2日

W-net: Bridged U-net for 2D Medical Image Segmentation

W-net: Bridged U-net for 2D Medical Image Segmentation

Arxiv

20+阅读 · 2018年7月12日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

10+阅读 · 2018年3月20日

相关基金

稀土元素对FeGa合金性能影响机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向高效供电和多端相互支撑的交直流混联配电运行控制研究

国家自然科学基金

0+阅读 · 2013年12月31日

主题模型建模框架下的高分辨率遥感影像半监督分类研究

国家自然科学基金

0+阅读 · 2013年12月31日

切换线性系统的若干动力学性质研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于风险测度的供应链鲁棒建模与策略研究

国家自然科学基金

2+阅读 · 2012年12月31日

Nb元素对TiAl合金高温疲劳性能的影响

国家自然科学基金

0+阅读 · 2012年12月31日

半监督半配对高维多表示数据的降维及拓展研究

国家自然科学基金

0+阅读 · 2011年12月31日

Delta 5 Stat5a与乳腺癌: Delta 5 Stat5a的全基因组结合位点分析及其表观基因组学研究

国家自然科学基金

0+阅读 · 2011年12月31日

积分几何与凸几何分析

国家自然科学基金

2+阅读 · 2009年12月31日

缺失数据下部分线性单指标模型的经验似然推断

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员