人类注意预测增强变异模型 (Semantic Segmentation Enhanced Transformer Model for Human Attention Prediction) - 专知论文

会员服务 ·

0

Learning · Attention · state-of-the-art · 变换 · Transformer模型 ·

2023 年 1 月 26 日

Semantic Segmentation Enhanced Transformer Model for Human Attention Prediction

翻译：人类注意预测增强变异模型

Saliency Prediction aims to predict the attention distribution of human eyes given an RGB image. Most of the recent state-of-the-art methods are based on deep image feature representations from traditional CNNs. However, the traditional convolution could not capture the global features of the image well due to its small kernel size. Besides, the high-level factors which closely correlate to human visual perception, e.g., objects, color, light, etc., are not considered. Inspired by these, we propose a Transformer-based method with semantic segmentation as another learning objective. More global cues of the image could be captured by Transformer. In addition, simultaneously learning the object segmentation simulates the human visual perception, which we would verify in our investigation of human gaze control in cognitive science. We build an extra decoder for the subtask and the multiple tasks share the same Transformer encoder, forcing it to learn from multiple feature spaces. We find in practice simply adding the subtask might confuse the main task learning, hence Multi-task Attention Module is proposed to deal with the feature interaction between the multiple learning targets. Our method achieves competitive performance compared to other state-of-the-art methods.

翻译：以 RGB 图像显示的人类眼睛的注意分布。最新最先进的方法大多基于传统CNN 的深度图像特征演示。然而,传统变异由于内核大小小,无法捕捉图像的全局特征。此外,没有考虑到与人类视觉感知密切相关的高层次因素,例如物体、颜色、光等。受这些因素的启发,我们提议采用以变异器为基础的方法,将语义分化作为另一个学习目标。变异器可以捕捉更多全球图像线索。此外,同时学习天体分解模拟人类视觉感知,我们将在对认知科学中的人类凝视控制进行调查时加以核实。我们为子任务和多重任务建造了一个额外的解码器, 共享相同的变异变器编码器, 迫使它从多个特征空间学习。我们发现, 在实践中, 仅仅添加子塔斯克可能混淆主要任务学习过程, 因此多功能注意模块被提议处理多个学习目标之间的地貌互动。

0

相关内容

Learning

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Volterra积分微分方程的多区间Chebyshev和Legendre谱配置法

国家自然科学基金

0+阅读 · 2015年12月31日

基于LED自适应照明优化的可见光通信网多域耦合传输技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

miR-204对人胚胎干细胞源性视网膜色素上皮细胞紧密连接的调控机制

国家自然科学基金

0+阅读 · 2013年12月31日

稀土簇及稀土与过渡金属簇-有机骨架的构筑及性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于核酸适配体识别的肿瘤靶向自组装DNA纳米笼载药系统的研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于CERES-MAIZE模型降水保险指数研究-以北京夏玉米为例

国家自然科学基金

0+阅读 · 2013年12月31日

Fibulin-5/β1-integrin 信号通路在醛固酮诱导血管平滑肌细胞凋亡中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

富锂层状氧化物的结构调控与电化学性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

陶瓷材料界面高温服役性能研究和界面抗疲劳特征

国家自然科学基金

0+阅读 · 2011年12月31日

双向、长距离光纤混沌保密通信研究

国家自然科学基金

0+阅读 · 2009年12月31日

FateZero: Fusing Attentions for Zero-shot Text-based Video Editing

Arxiv

0+阅读 · 2023年3月17日

Predicting Human Attention using Computational Attention

Arxiv

0+阅读 · 2023年3月16日

KS-DETR: Knowledge Sharing in Attention Learning for Detection Transformer

Arxiv

0+阅读 · 2023年3月16日

AutoEnsemble: Automated Ensemble Search Framework for Semantic Segmentation Using Image Labels

Arxiv

0+阅读 · 2023年3月15日

DABERT: Dual Attention Enhanced BERT for Semantic Matching

Arxiv

0+阅读 · 2023年3月15日

Activation Modulation and Recalibration Scheme for Weakly Supervised Semantic Segmentation

Arxiv

12+阅读 · 2021年12月16日

Cross-Modal Self-Attention Network for Referring Image Segmentation

Cross-Modal Self-Attention Network for Referring Image Segmentation

Arxiv

18+阅读 · 2019年4月9日

Deep Representation Learning for Domain Adaptation of Semantic Image Segmentation

Arxiv

10+阅读 · 2018年5月10日

End-to-End Dense Video Captioning with Masked Transformer

Arxiv

14+阅读 · 2018年4月3日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

VIP会员

文章信息

相关主题

state-of-the-art

Transformer模型

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

FateZero: Fusing Attentions for Zero-shot Text-based Video Editing

Arxiv

0+阅读 · 2023年3月17日

Predicting Human Attention using Computational Attention

Arxiv

0+阅读 · 2023年3月16日

KS-DETR: Knowledge Sharing in Attention Learning for Detection Transformer

Arxiv

0+阅读 · 2023年3月16日

AutoEnsemble: Automated Ensemble Search Framework for Semantic Segmentation Using Image Labels

Arxiv

0+阅读 · 2023年3月15日

DABERT: Dual Attention Enhanced BERT for Semantic Matching

Arxiv

0+阅读 · 2023年3月15日

Activation Modulation and Recalibration Scheme for Weakly Supervised Semantic Segmentation

Arxiv

12+阅读 · 2021年12月16日

Cross-Modal Self-Attention Network for Referring Image Segmentation

Cross-Modal Self-Attention Network for Referring Image Segmentation

Arxiv

18+阅读 · 2019年4月9日

Deep Representation Learning for Domain Adaptation of Semantic Image Segmentation

Arxiv

10+阅读 · 2018年5月10日

End-to-End Dense Video Captioning with Masked Transformer

Arxiv

14+阅读 · 2018年4月3日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

相关基金

Volterra积分微分方程的多区间Chebyshev和Legendre谱配置法

国家自然科学基金

0+阅读 · 2015年12月31日

基于LED自适应照明优化的可见光通信网多域耦合传输技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

miR-204对人胚胎干细胞源性视网膜色素上皮细胞紧密连接的调控机制

国家自然科学基金

0+阅读 · 2013年12月31日

稀土簇及稀土与过渡金属簇-有机骨架的构筑及性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于核酸适配体识别的肿瘤靶向自组装DNA纳米笼载药系统的研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于CERES-MAIZE模型降水保险指数研究-以北京夏玉米为例

国家自然科学基金

0+阅读 · 2013年12月31日

Fibulin-5/β1-integrin 信号通路在醛固酮诱导血管平滑肌细胞凋亡中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

富锂层状氧化物的结构调控与电化学性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

陶瓷材料界面高温服役性能研究和界面抗疲劳特征

国家自然科学基金

0+阅读 · 2011年12月31日

双向、长距离光纤混沌保密通信研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员