MulT: 端到端多任务学习变换器 (MulT: An End-to-End Multitask Learning Transformer) - 专知论文

会员服务 ·

0

变换 · MoDELS · 学成 · 端到端 · Vision ·

2022 年 5 月 17 日

MulT: An End-to-End Multitask Learning Transformer

翻译：MulT: 端到端多任务学习变换器

Deblina Bhattacharjee,Tong Zhang,Sabine Süsstrunk,Mathieu Salzmann

from arxiv, Accepted to CVPR 2022

We propose an end-to-end Multitask Learning Transformer framework, named MulT, to simultaneously learn multiple high-level vision tasks, including depth estimation, semantic segmentation, reshading, surface normal estimation, 2D keypoint detection, and edge detection. Based on the Swin transformer model, our framework encodes the input image into a shared representation and makes predictions for each vision task using task-specific transformer-based decoder heads. At the heart of our approach is a shared attention mechanism modeling the dependencies across the tasks. We evaluate our model on several multitask benchmarks, showing that our MulT framework outperforms both the state-of-the art multitask convolutional neural network models and all the respective single task transformer models. Our experiments further highlight the benefits of sharing attention across all the tasks, and demonstrate that our MulT model is robust and generalizes well to new domains. Our project website is at https://ivrl.github.io/MulT/.

翻译：我们提议了一个名为MulT的端到端多任务学习变异器框架,以同时学习多个高层次的愿景任务,包括深度估计、语义分割、重新阴影、表面正常估计、2D关键点探测和边缘探测。基于Swin变异器模型,我们的框架将输入图像编码成一个共享的表达方式,并利用基于任务变异器的脱coder头目对每一项愿景任务作出预测。我们的方法的核心是建立一个共同关注机制,对各项任务之间的依赖性进行建模。我们用多个多任务基准来评估我们的模型,显示我们的MulT框架超越了艺术的多任务共进神经网络模型和所有相应的单一任务变异器模型。我们的实验进一步强调了在所有任务中共享关注的好处,并表明我们的MulT模型是强大的,并且对新的领域进行了广泛的普及。我们的项目网站是 https://ivrl.github.io/MulT/。

0

相关内容

【Tutorial】计算机视觉中的Transformer，98页ppt

【Tutorial】计算机视觉中的Transformer，98页ppt

专知会员服务

151+阅读 · 2021年10月25日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

多任务学习(Multitask-Learning)相关资料、经典论文、开源代码整理分享

多任务学习(Multitask-Learning)相关资料、经典论文、开源代码整理分享

深度学习与NLP

45+阅读 · 2019年10月22日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【论文推荐】最新七篇自注意力机制(Self-attention)相关论文—结构化自注意力、相对位置、混合、句子表达、文本向量

【论文推荐】最新七篇自注意力机制(Self-attention)相关论文—结构化自注意力、相对位置、混合、句子表达、文本向量

专知

29+阅读 · 2018年3月12日

DMRTA1/RAGE调控肝脏胰岛素抵抗的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

核磁共振研究抗病毒蛋白IFITM3的结构和抗病毒分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

绵马贯众间苯三酚类化合物黄绵马酸AB抑制A型流感病毒复制的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

特定lincRNA在体细胞重编程中的功能与机制

国家自然科学基金

0+阅读 · 2012年12月31日

拟南芥DIF（DRIP1-Interacting Factor）在胁迫信号应答中的功能分析

国家自然科学基金

0+阅读 · 2012年12月31日

家蚕细小病毒样病毒非结构蛋白NS1的表达调控及靶分子识别

国家自然科学基金

0+阅读 · 2012年12月31日

ZmRop1调控玉米抗甘蔗花叶病毒的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

乙型脑炎病毒激活小胶质细胞的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

Ndfip1蛋白抑制神经细胞凋亡的分子机制研究

国家自然科学基金

1+阅读 · 2010年12月31日

番茄花叶病毒CP与烟草Fd I直接互作的分子机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

ABAW: Learning from Synthetic Data & Multi-Task Learning Challenges

Arxiv

0+阅读 · 2022年7月5日

Multimodal Frame-Scoring Transformer for Video Summarization

Arxiv

0+阅读 · 2022年7月5日

CRFormer: A Cross-Region Transformer for Shadow Removal

Arxiv

0+阅读 · 2022年7月4日

I-ViT: Integer-only Quantization for Efficient Vision Transformer Inference

Arxiv

0+阅读 · 2022年7月4日

Adaptive Multi-view and Temporal Fusing Transformer for 3D Human Pose Estimation

Arxiv

0+阅读 · 2022年7月4日

A Survey of Visual Transformers

Arxiv

39+阅读 · 2021年11月11日

Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

Arxiv

13+阅读 · 2021年4月7日

Transformer Tracking

Arxiv

17+阅读 · 2021年3月29日

Learning from History: Modeling Temporal Knowledge Graphs with Sequential Copy-Generation Networks

Arxiv

11+阅读 · 2020年12月15日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

VIP会员

文章信息

相关主题

相关VIP内容

【Tutorial】计算机视觉中的Transformer，98页ppt

【Tutorial】计算机视觉中的Transformer，98页ppt

专知会员服务

151+阅读 · 2021年10月25日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

多任务学习(Multitask-Learning)相关资料、经典论文、开源代码整理分享

多任务学习(Multitask-Learning)相关资料、经典论文、开源代码整理分享

深度学习与NLP

45+阅读 · 2019年10月22日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【论文推荐】最新七篇自注意力机制(Self-attention)相关论文—结构化自注意力、相对位置、混合、句子表达、文本向量

【论文推荐】最新七篇自注意力机制(Self-attention)相关论文—结构化自注意力、相对位置、混合、句子表达、文本向量

专知

29+阅读 · 2018年3月12日

相关论文

ABAW: Learning from Synthetic Data & Multi-Task Learning Challenges

Arxiv

0+阅读 · 2022年7月5日

Multimodal Frame-Scoring Transformer for Video Summarization

Arxiv

0+阅读 · 2022年7月5日

CRFormer: A Cross-Region Transformer for Shadow Removal

Arxiv

0+阅读 · 2022年7月4日

I-ViT: Integer-only Quantization for Efficient Vision Transformer Inference

Arxiv

0+阅读 · 2022年7月4日

Adaptive Multi-view and Temporal Fusing Transformer for 3D Human Pose Estimation

Arxiv

0+阅读 · 2022年7月4日

A Survey of Visual Transformers

Arxiv

39+阅读 · 2021年11月11日

Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

Arxiv

13+阅读 · 2021年4月7日

Transformer Tracking

Arxiv

17+阅读 · 2021年3月29日

Learning from History: Modeling Temporal Knowledge Graphs with Sequential Copy-Generation Networks

Arxiv

11+阅读 · 2020年12月15日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

相关基金

DMRTA1/RAGE调控肝脏胰岛素抵抗的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

核磁共振研究抗病毒蛋白IFITM3的结构和抗病毒分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

绵马贯众间苯三酚类化合物黄绵马酸AB抑制A型流感病毒复制的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

特定lincRNA在体细胞重编程中的功能与机制

国家自然科学基金

0+阅读 · 2012年12月31日

拟南芥DIF（DRIP1-Interacting Factor）在胁迫信号应答中的功能分析

国家自然科学基金

0+阅读 · 2012年12月31日

家蚕细小病毒样病毒非结构蛋白NS1的表达调控及靶分子识别

国家自然科学基金

0+阅读 · 2012年12月31日

ZmRop1调控玉米抗甘蔗花叶病毒的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

乙型脑炎病毒激活小胶质细胞的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

Ndfip1蛋白抑制神经细胞凋亡的分子机制研究

国家自然科学基金

1+阅读 · 2010年12月31日

番茄花叶病毒CP与烟草Fd I直接互作的分子机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员