以心理物理为导向的相容图预测模型 (An Psychophysical Oriented Saliency Map Prediction Model) - 专知论文

会员服务 ·

0

显著图 · MoDELS · INFORMS · 可约的 · Performance ·

2021 年 5 月 20 日

An Psychophysical Oriented Saliency Map Prediction Model

翻译：以心理物理为导向的相容图预测模型

Visual attention is one of the most significant characteristics for selecting and understanding the outside redundancy world. The nature of complex scenes includes enormous redundancy. The human vision system can not process all information simultaneously because of visual information bottleneck. The human visual system mainly focuses on dominant parts of the scenes to reduce the input visual redundancy information. It is commonly known as visual attention prediction or visual saliency map. This paper proposes a new psychophysical saliency prediction architecture, WECSF, inspired by human low-level visual cortex function. The model consists of opponent color channels, wavelet transform, wavelet energy map, and contrast sensitivity function for extracting low-level image features and maximum approximation to the human visual system. The proposed model is evaluated several datasets, including MIT1003, MIT300, TORONTO, SID4VAM and UCF Sports dataset to explain its efficiency. We also quantitatively and qualitatively compared the performance of saliency prediction with other state-of-the-art models. Our model achieved very stable and good performance. Second, we also confirmed that Fourier and spectral-inspired saliency prediction models achieved outperformance compared to other start-of-the-art non-neural networks and even deep neural network models on psychophysical synthesis images. Finally, the proposed model also can be applied to spatial-temporal saliency prediction and got better performance.

翻译：视觉关注是选择和理解外部冗余世界的最重要特征之一。复杂场景的性质包括巨大的冗余。人类视觉系统不能同时处理所有信息, 因为视觉信息瓶颈。人类视觉系统主要侧重于场景的主要部分, 以减少输入的视觉冗余信息。它通常被称为视觉关注预测或视觉显著地图。本文提出了一个新的心理物理显著预测结构, 即WECSF, 受人类低水平视觉皮质功能的启发。模型由对手颜色频道、波盘变换、波盘能量映射和对比感应功能组成, 用于提取低级别图像特征和人类视觉系统的最大近似值的对比感应功能。所拟议的模型被评估的数据集包括MIT1003、 MIT300、 TORONTO、 SID4VAM 和 UCFC 体育数据集, 以解释其效率。我们还从量和质上将显性预测与其他状态的视觉皮质模型进行比较。我们的模型取得了非常稳定和良好的性能。其次, 我们还确认, 4级和光谱显著的显微预测模型比其他开始的网络和非空间图像化模型还被应用。

0

相关内容

显著图

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

最新《神经架构搜索NAS》教程，33页pdf

最新《神经架构搜索NAS》教程，33页pdf

专知会员服务

27+阅读 · 2020年12月2日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

专知会员服务

136+阅读 · 2020年3月8日

【干货】深度学习视觉跟踪:论文最新综述，23页pdf，Deep Learning for Visual Tracking: A Comprehensive Survey

【干货】深度学习视觉跟踪:论文最新综述，23页pdf，Deep Learning for Visual Tracking: A Comprehensive Survey

专知会员服务

57+阅读 · 2019年12月2日

【ECML-PKDD 2019】基于邻域增强LSTM模型的出租车乘客需求预测（A Neighborhood-augmented LSTM Model for Taxi-Passenger Demand Prediction）

【ECML-PKDD 2019】基于邻域增强LSTM模型的出租车乘客需求预测（A Neighborhood-augmented LSTM Model for Taxi-Passenger Demand Prediction）

专知会员服务

21+阅读 · 2019年12月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

已删除

将门创投

5+阅读 · 2020年3月2日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

Interflow: Aggregating Multi-layer Feature Mappings with Attention Mechanism

Arxiv

0+阅读 · 2021年7月13日

DDCNet-Multires: Effective Receptive Field Guided Multiresolution CNN for Dense Prediction

Arxiv

0+阅读 · 2021年7月12日

Align Deep Features for Oriented Object Detection

Arxiv

0+阅读 · 2021年7月12日

Dialogue State Tracking with Multi-Level Fusion of Predicted Dialogue States and Conversations

Arxiv

0+阅读 · 2021年7月12日

QoS Prediction for 5G Connected and Automated Driving

Arxiv

0+阅读 · 2021年7月11日

Predicting Risk-adjusted Returns using an Asset Independent Regime-switching Model

Arxiv

0+阅读 · 2021年7月7日

End-to-end Lane Shape Prediction with Transformers

Arxiv

3+阅读 · 2020年11月28日

Learning Discriminative Model Prediction for Tracking

Learning Discriminative Model Prediction for Tracking

Arxiv

6+阅读 · 2019年4月15日

Integrated Object Detection and Tracking with Tracklet-Conditioned Detection

Arxiv

3+阅读 · 2018年11月27日

Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention

Arxiv

7+阅读 · 2018年5月21日

VIP会员

文章信息

相关主题

相关VIP内容

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

最新《神经架构搜索NAS》教程，33页pdf

最新《神经架构搜索NAS》教程，33页pdf

专知会员服务

27+阅读 · 2020年12月2日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

专知会员服务

136+阅读 · 2020年3月8日

【干货】深度学习视觉跟踪:论文最新综述，23页pdf，Deep Learning for Visual Tracking: A Comprehensive Survey

【干货】深度学习视觉跟踪:论文最新综述，23页pdf，Deep Learning for Visual Tracking: A Comprehensive Survey

专知会员服务

57+阅读 · 2019年12月2日

【ECML-PKDD 2019】基于邻域增强LSTM模型的出租车乘客需求预测（A Neighborhood-augmented LSTM Model for Taxi-Passenger Demand Prediction）

【ECML-PKDD 2019】基于邻域增强LSTM模型的出租车乘客需求预测（A Neighborhood-augmented LSTM Model for Taxi-Passenger Demand Prediction）

专知会员服务

21+阅读 · 2019年12月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

AI行业专题报告：国产Agent不断演进，通用协议推进系统性应用

《基于MCP的软件设计模式视角下的大型语言模型智能体通信研究综述》

【ICML2025】大语言模型是自我示范预选择器

【斯坦福博士论文】可扩展、高效且安全的机器学习数据系统

相关资讯

已删除

将门创投

5+阅读 · 2020年3月2日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

相关论文

Interflow: Aggregating Multi-layer Feature Mappings with Attention Mechanism

Arxiv

0+阅读 · 2021年7月13日

DDCNet-Multires: Effective Receptive Field Guided Multiresolution CNN for Dense Prediction

Arxiv

0+阅读 · 2021年7月12日

Align Deep Features for Oriented Object Detection

Arxiv

0+阅读 · 2021年7月12日

Dialogue State Tracking with Multi-Level Fusion of Predicted Dialogue States and Conversations

Arxiv

0+阅读 · 2021年7月12日

QoS Prediction for 5G Connected and Automated Driving

Arxiv

0+阅读 · 2021年7月11日

Predicting Risk-adjusted Returns using an Asset Independent Regime-switching Model

Arxiv

0+阅读 · 2021年7月7日

End-to-end Lane Shape Prediction with Transformers

Arxiv

3+阅读 · 2020年11月28日

Learning Discriminative Model Prediction for Tracking

Learning Discriminative Model Prediction for Tracking

Arxiv

6+阅读 · 2019年4月15日

Integrated Object Detection and Tracking with Tracklet-Conditioned Detection

Arxiv

3+阅读 · 2018年11月27日

Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention

Arxiv

7+阅读 · 2018年5月21日

微信扫码咨询专知VIP会员