改进与平行连连锁网络和时间差异损失的随时预测 (Improving Anytime Prediction with Parallel Cascaded Networks and a Temporal-Difference Loss) - 专知论文

会员服务 ·

0

级联 · INFORMS · 前馈 · Networking · ResNet ·

2021 年 6 月 11 日

Improving Anytime Prediction with Parallel Cascaded Networks and a Temporal-Difference Loss

翻译：改进与平行连连锁网络和时间差异损失的随时预测

Michael L. Iuzzolino,Michael C. Mozer,Samy Bengio

Although deep feedforward neural networks share some characteristics with the primate visual system, a key distinction is their dynamics. Deep nets typically operate in serial stages wherein each layer completes its computation before processing begins in subsequent layers. In contrast, biological systems have cascaded dynamics: information propagates from neurons at all layers in parallel but transmission occurs gradually over time, leading to speed-accuracy trade offs even in feedforward architectures. We explore the consequences of biologically inspired parallel hardware by constructing cascaded ResNets in which each residual block has propagation delays but all blocks update in parallel in a stateful manner. Because information transmitted through skip connections avoids delays, the functional depth of the architecture increases over time, yielding anytime predictions that improve with internal-processing time. We introduce a temporal-difference training loss that achieves a strictly superior speed-accuracy profile over standard losses and enables the cascaded architecture to outperform state-of-the-art anytime-prediction methods. The cascaded architecture has intriguing properties, including: it classifies typical instances more rapidly than atypical instances; it is more robust to both persistent and transient noise than is a conventional ResNet; and its time-varying output trace provides a signal that can be exploited to improve information processing and inference.

翻译：尽管深饲料向神经网络与灵长视系统具有某些特征,但关键区别在于其动态。深网通常在序列阶段运行,每个层在加工开始之前在随后的层中完成计算。相反,生物系统具有连锁动态:各个层的神经信息平行传播,但传播过程逐渐发生,导致即使是进料向前结构也发生速度-准确性交易。我们通过建造级联ResNet来探索生物启发平行硬件的后果,其中每个残余块都有传播延迟,但所有块都以状态化的方式同步更新。由于通过跳过连接传输的信息可以避免延误,结构的功能深度会随着时间的推移而增加,随着内部处理时间的改善而随时产生预测。我们引入了时间-偏差培训损失,从而使得标准损失的超超超速速度-准确性交易,并使级结构能够超越最先进的时间定位方法。累进结构的特性包括:通过跳过连接传输的信息可以更快地分解典型实例,因此结构结构的功能深度会随着时间而增加,随着时间推移的信号会随着内部处理而不断更新。

0

相关内容

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【KDD2020】多层次图卷积网络的跨平台锚链预测，Multi-level Graph Convolutional Networks for Cross-platform Anchor Link Prediction

【KDD2020】多层次图卷积网络的跨平台锚链预测，Multi-level Graph Convolutional Networks for Cross-platform Anchor Link Prediction

专知会员服务

34+阅读 · 2020年6月7日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【IC2S2 2019教程】社会信息网络分析的计算模型（Computational Models for Social and Information Network Analysis），清华大学计算机系教授唐杰

【IC2S2 2019教程】社会信息网络分析的计算模型（Computational Models for Social and Information Network Analysis），清华大学计算机系教授唐杰

专知会员服务

40+阅读 · 2019年11月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

分布式并行架构Ray介绍

分布式并行架构Ray介绍

CreateAMind

10+阅读 · 2019年8月9日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

随波逐流：Similarity-Adaptive and Discrete Optimization

随波逐流：Similarity-Adaptive and Discrete Optimization

我爱读PAMI

5+阅读 · 2018年2月6日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

From #Jobsearch to #Mask: Improving COVID-19 Cascade Prediction with Spillover Effects

From #Jobsearch to #Mask: Improving COVID-19 Cascade Prediction with Spillover Effects

Arxiv

0+阅读 · 2021年8月11日

Internal Video Inpainting by Implicit Long-range Propagation

Arxiv

0+阅读 · 2021年8月10日

Machine learning for surface prediction in ACTS

Machine learning for surface prediction in ACTS

Arxiv

0+阅读 · 2021年8月6日

Resolution Adaptive Networks for Efficient Inference

Arxiv

5+阅读 · 2020年3月16日

Improved Speech Enhancement with the Wave-U-Net

Arxiv

8+阅读 · 2018年11月27日

LPCNet: Improving Neural Speech Synthesis Through Linear Prediction

Arxiv

3+阅读 · 2018年10月28日

A Fully Convolutional Two-Stream Fusion Network for Interactive Image Segmentation

Arxiv

3+阅读 · 2018年10月2日

Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering

Arxiv

4+阅读 · 2018年4月3日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

9+阅读 · 2018年3月14日

Deep Predictive Coding Network for Object Recognition

Arxiv

6+阅读 · 2018年2月13日

VIP会员

文章信息

相关主题

相关VIP内容

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【KDD2020】多层次图卷积网络的跨平台锚链预测，Multi-level Graph Convolutional Networks for Cross-platform Anchor Link Prediction

【KDD2020】多层次图卷积网络的跨平台锚链预测，Multi-level Graph Convolutional Networks for Cross-platform Anchor Link Prediction

专知会员服务

34+阅读 · 2020年6月7日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【IC2S2 2019教程】社会信息网络分析的计算模型（Computational Models for Social and Information Network Analysis），清华大学计算机系教授唐杰

【IC2S2 2019教程】社会信息网络分析的计算模型（Computational Models for Social and Information Network Analysis），清华大学计算机系教授唐杰

专知会员服务

40+阅读 · 2019年11月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

《生态建模密码破译：建模与编程实践》美陆军最新报告

大模型解决方案白皮书：社交陪伴场景全流程落地指南

面向具身操作的视觉-语言-动作模型综述

相关资讯

分布式并行架构Ray介绍

分布式并行架构Ray介绍

CreateAMind

10+阅读 · 2019年8月9日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

【NIPS2018】接收论文列表

【NIPS2018】接收论文列表

专知

5+阅读 · 2018年9月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

随波逐流：Similarity-Adaptive and Discrete Optimization

随波逐流：Similarity-Adaptive and Discrete Optimization

我爱读PAMI

5+阅读 · 2018年2月6日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

From #Jobsearch to #Mask: Improving COVID-19 Cascade Prediction with Spillover Effects

From #Jobsearch to #Mask: Improving COVID-19 Cascade Prediction with Spillover Effects

Arxiv

0+阅读 · 2021年8月11日

Internal Video Inpainting by Implicit Long-range Propagation

Arxiv

0+阅读 · 2021年8月10日

Machine learning for surface prediction in ACTS

Machine learning for surface prediction in ACTS

Arxiv

0+阅读 · 2021年8月6日

Resolution Adaptive Networks for Efficient Inference

Arxiv

5+阅读 · 2020年3月16日

Improved Speech Enhancement with the Wave-U-Net

Arxiv

8+阅读 · 2018年11月27日

LPCNet: Improving Neural Speech Synthesis Through Linear Prediction

Arxiv

3+阅读 · 2018年10月28日

A Fully Convolutional Two-Stream Fusion Network for Interactive Image Segmentation

Arxiv

3+阅读 · 2018年10月2日

Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering

Arxiv

4+阅读 · 2018年4月3日

An application of cascaded 3D fully convolutional networks for medical image segmentation

Arxiv

9+阅读 · 2018年3月14日

Deep Predictive Coding Network for Object Recognition

Arxiv

6+阅读 · 2018年2月13日

微信扫码咨询专知VIP会员