ISP4ML:了解图像信号处理在高效深学习愿景系统中的作用 (ISP4ML: Understanding the Role of Image Signal Processing in Efficient Deep Learning Vision Systems) - 专知论文

会员服务 ·

0

模型评估 · Signal Processing · Vision · Processing（编程语言） · 可理解性 ·

2021 年 3 月 17 日

ISP4ML: Understanding the Role of Image Signal Processing in Efficient Deep Learning Vision Systems

翻译：ISP4ML:了解图像信号处理在高效深学习愿景系统中的作用

Patrick Hansen,Alexey Vilkin,Yury Khrustalev,James Imber,David Hanwell,Matthew Mattina,Paul N. Whatmough

from arxiv, 13 pages, 11 figures

Convolutional neural networks (CNNs) are now predominant components in a variety of computer vision (CV) systems. These systems typically include an image signal processor (ISP), even though the ISP is traditionally designed to produce images that look appealing to humans. In CV systems, it is not clear what the role of the ISP is, or if it is even required at all for accurate prediction. In this work, we investigate the efficacy of the ISP in CNN classification tasks, and outline the system-level trade-offs between prediction accuracy and computational cost. To do so, we build software models of a configurable ISP and an imaging sensor in order to train CNNs on ImageNet with a range of different ISP settings and functionality. Results on ImageNet show that an ISP improves accuracy by 4.6%-12.2% on MobileNet architectures of different widths. Results using ResNets demonstrate that these trends also generalize to deeper networks. An ablation study of the various processing stages in a typical ISP reveals that the tone mapper is the most significant stage when operating on high dynamic range (HDR) images, by providing 5.8% average accuracy improvement alone. Overall, the ISP benefits system efficiency because the memory and computational costs of the ISP is minimal compared to the cost of using a larger CNN to achieve the same accuracy.

翻译：在各种计算机视觉系统(CV)中,这些系统通常包括一个图像信号处理器(ISP)的软件模型和成像传感器,以在图像网络上培训有线电视新闻网,其设置和功能各不相同。在CV系统中,不清楚ISP的作用是什么,或者是否甚至需要它来准确预测。在这项工作中,我们调查ISP在CNN分类任务中的功效,并概述系统一级预测准确性和计算成本之间的取舍。为此,我们建造了一个可配置ISP和成像传感器的软件模型,以便用各种ISP设置和功能在图像网络上培训CNNs。在图像网络上的结果显示,ISP在不同宽度的移动网络结构中提高了4.6%至12.2%的准确度。使用ResNet的结果显示,这些趋势也普遍到更深的网络。对典型ISP的各个处理阶段进行对比研究表明,在高动态范围操作时,音调制图仪是最重要的阶段(HDR),用不同的ISP设置和更大的图像进行图像的精确度,因为对ISP的精确度进行了比较,只有5.8%的精确度改进,因为I的精确度是I的完整的精确度,只有I的精确度才能进行整个的计算。

0

相关内容

模型评估

机器学习系统设计系统评估标准

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

325+阅读 · 2020年11月26日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【2020新书】深度学习视觉系统，Deep Learning for Vision Systems, 396页pdf

【2020新书】深度学习视觉系统，Deep Learning for Vision Systems, 396页pdf

专知会员服务

171+阅读 · 2020年2月18日

【MLA 2019】学习因果关系与因果关系学习（Learning Causality and Learning with Causality: A Road to Intelligence）美国卡内基梅隆大学，张坤

【MLA 2019】学习因果关系与因果关系学习（Learning Causality and Learning with Causality: A Road to Intelligence）美国卡内基梅隆大学，张坤

专知会员服务

126+阅读 · 2019年11月16日

【2019 北京智源大会】Deep Learning, NLP, and Bioinformatics 李明/ 滑铁卢大学计算机科学系教授

【2019 北京智源大会】Deep Learning, NLP, and Bioinformatics 李明/ 滑铁卢大学计算机科学系教授

专知会员服务

10+阅读 · 2019年11月1日

Understanding Color and the In-Camera Image Processing Pipeline for Computer Vision 【Michael S. Brown IEEE】韩国 ICCV 2019

Understanding Color and the In-Camera Image Processing Pipeline for Computer Vision 【Michael S. Brown IEEE】韩国 ICCV 2019

专知会员服务

10+阅读 · 2019年10月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

深度卷积神经网络中的降采样

深度卷积神经网络中的降采样

极市平台

12+阅读 · 2019年5月24日

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

PaperWeekly

7+阅读 · 2019年5月5日

计算机 | CCF推荐期刊专刊信息5条

计算机 | CCF推荐期刊专刊信息5条

Call4Papers

3+阅读 · 2019年4月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

计算机类 | 期刊专刊截稿信息9条

计算机类 | 期刊专刊截稿信息9条

Call4Papers

4+阅读 · 2018年1月26日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

AI研习社

17+阅读 · 2017年10月21日

【计算机类】期刊专刊/国际会议截稿信息6条

【计算机类】期刊专刊/国际会议截稿信息6条

Call4Papers

3+阅读 · 2017年10月13日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms

Arxiv

0+阅读 · 2021年5月11日

Quick NAT: High performance NAT system on commodity platforms

Arxiv

0+阅读 · 2021年5月9日

Facial Emotion Recognition: State of the Art Performance on FER2013

Arxiv

0+阅读 · 2021年5月8日

Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models

Arxiv

0+阅读 · 2021年5月7日

On the Time Series Length for an Accurate Fractal Analysis in Network Systems

Arxiv

0+阅读 · 2021年5月6日

A Survey on Active Learning and Human-in-the-Loop Deep Learning for Medical Image Analysis

Arxiv

1+阅读 · 2021年5月5日

Learning in the Frequency Domain

Learning in the Frequency Domain

Arxiv

11+阅读 · 2020年3月12日

Image Segmentation Using Deep Learning: A Survey

Image Segmentation Using Deep Learning: A Survey

Arxiv

47+阅读 · 2020年1月15日

Learning Cross-Modal Deep Embeddings for Multi-Object Image Retrieval using Text and Sketch

Arxiv

5+阅读 · 2018年4月28日

Convolutional Sequence to Sequence Learning

Arxiv

4+阅读 · 2017年7月25日

VIP会员

文章信息

相关主题

Signal Processing

Processing（编程语言）

相关VIP内容

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

325+阅读 · 2020年11月26日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【2020新书】深度学习视觉系统，Deep Learning for Vision Systems, 396页pdf

【2020新书】深度学习视觉系统，Deep Learning for Vision Systems, 396页pdf

专知会员服务

171+阅读 · 2020年2月18日

【MLA 2019】学习因果关系与因果关系学习（Learning Causality and Learning with Causality: A Road to Intelligence）美国卡内基梅隆大学，张坤

【MLA 2019】学习因果关系与因果关系学习（Learning Causality and Learning with Causality: A Road to Intelligence）美国卡内基梅隆大学，张坤

专知会员服务

126+阅读 · 2019年11月16日

【2019 北京智源大会】Deep Learning, NLP, and Bioinformatics 李明/ 滑铁卢大学计算机科学系教授

【2019 北京智源大会】Deep Learning, NLP, and Bioinformatics 李明/ 滑铁卢大学计算机科学系教授

专知会员服务

10+阅读 · 2019年11月1日

Understanding Color and the In-Camera Image Processing Pipeline for Computer Vision 【Michael S. Brown IEEE】韩国 ICCV 2019

Understanding Color and the In-Camera Image Processing Pipeline for Computer Vision 【Michael S. Brown IEEE】韩国 ICCV 2019

专知会员服务

10+阅读 · 2019年10月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

视觉-语言-动作模型解析：从模块构成到里程碑与挑战

《解析陆域作战方向：一个概念性框架》报告

【博士论文】基于多模态基础模型的上下文学习

追寻真正的AI自主性：从遗留思维到战场优势

相关资讯

深度卷积神经网络中的降采样

深度卷积神经网络中的降采样

极市平台

12+阅读 · 2019年5月24日

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

PaperWeekly

7+阅读 · 2019年5月5日

计算机 | CCF推荐期刊专刊信息5条

计算机 | CCF推荐期刊专刊信息5条

Call4Papers

3+阅读 · 2019年4月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

计算机类 | 期刊专刊截稿信息9条

计算机类 | 期刊专刊截稿信息9条

Call4Papers

4+阅读 · 2018年1月26日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

AI研习社

17+阅读 · 2017年10月21日

【计算机类】期刊专刊/国际会议截稿信息6条

【计算机类】期刊专刊/国际会议截稿信息6条

Call4Papers

3+阅读 · 2017年10月13日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

相关论文

Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms

Arxiv

0+阅读 · 2021年5月11日

Quick NAT: High performance NAT system on commodity platforms

Arxiv

0+阅读 · 2021年5月9日

Facial Emotion Recognition: State of the Art Performance on FER2013

Arxiv

0+阅读 · 2021年5月8日

Subsentence Extraction from Text Using Coverage-Based Deep Learning Language Models

Arxiv

0+阅读 · 2021年5月7日

On the Time Series Length for an Accurate Fractal Analysis in Network Systems

Arxiv

0+阅读 · 2021年5月6日

A Survey on Active Learning and Human-in-the-Loop Deep Learning for Medical Image Analysis

Arxiv

1+阅读 · 2021年5月5日

Learning in the Frequency Domain

Learning in the Frequency Domain

Arxiv

11+阅读 · 2020年3月12日

Image Segmentation Using Deep Learning: A Survey

Image Segmentation Using Deep Learning: A Survey

Arxiv

47+阅读 · 2020年1月15日

Learning Cross-Modal Deep Embeddings for Multi-Object Image Retrieval using Text and Sketch

Arxiv

5+阅读 · 2018年4月28日

Convolutional Sequence to Sequence Learning

Arxiv

4+阅读 · 2017年7月25日

微信扫码咨询专知VIP会员