LSTM 与Encoder-Decoder LSTM 用于视频预报的多尺度网络 (Predictive Coding Based Multiscale Network with Encoder-Decoder LSTM for Video Prediction) - 专知论文

会员服务 ·

0

INFORMS · 长短期记忆网络 · Networking · 可约的 · 自下而上 ·

2022 年 12 月 22 日

Predictive Coding Based Multiscale Network with Encoder-Decoder LSTM for Video Prediction

翻译：LSTM 与Encoder-Decoder LSTM 用于视频预报的多尺度网络

Chaofan Ling,Junpei Zhong,Weihua Li

We are introducing a multi-scale predictive model for video prediction here, whose design is inspired by the "Predictive Coding" theories and "Coarse to Fine" approach. As a predictive coding model, it is updated by a combination of bottom-up and top-down information flows, which is different from traditional bottom-up training style. Its advantage is to reduce the dependence on input information and improve its ability to predict and generate images. Importantly, we achieve with a multi-scale approach -- higher level neurons generate coarser predictions (lower resolution), while the lower level generate finer predictions (higher resolution). This is different from the traditional predictive coding framework in which higher level predict the activity of neurons in lower level. To improve the predictive ability, we integrate an encoder-decoder network in the LSTM architecture and share the final encoded high-level semantic information between different levels. Additionally, since the output of each network level is an RGB image, a smaller LSTM hidden state can be used to retain and update the only necessary hidden information, avoiding being mapped to an overly discrete and complex space. In this way, we can reduce the difficulty of prediction and the computational overhead. Finally, we further explore the training strategies, to address the instability in adversarial training and mismatch between training and testing in long-term prediction. Code is available at https://github.com/Ling-CF/MSPN.

翻译：我们在此推出一个多尺度的视频预测预测模型,其设计受“预先编码”理论和“粗到精”方法的启发。作为一种预测编码模型,它通过自下而上和自上而下的信息流动相结合加以更新,这与传统的自下而上培训风格不同。它的优点是减少对输入信息的依赖,提高预测和生成图像的能力。重要的是,我们通过一个多尺度的方法实现了一个更高的神经神经元产生粗略预测(低分辨率),而较低的神经元则产生更精细的预测(高分辨率)。这与传统的预测编码框架不同,在传统的预测编码框架中,高层次预测神经元的活动会达到更高的水平。为了提高预测能力,我们将编码编码解码网络纳入LSTM结构,并在不同级别之间分享最后编码的高层次的测算性测算信息。此外,由于每个网络级别的产出是RGB图像(低分辨率),因此,一个较小的LSTM隐藏状态可以用来保留和更新必要的隐藏信息(高分辨率分辨率分辨率分辨率)。我们可以在长期的预测和复杂空间的测试中进一步减少我们进行不连续的、不连续和不连续的预测性分析的测试。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

手性磷酸和过渡金属共催化的串联反应研究：高对映选择性地合成杂环化合物

国家自然科学基金

1+阅读 · 2015年12月31日

聚偏氟乙烯嵌段共聚物的溶液结晶驱动自组装研究

国家自然科学基金

0+阅读 · 2015年12月31日

电沉积制备结构和性能可控的纳米结构硅合金膜层的研究

国家自然科学基金

0+阅读 · 2014年12月31日

均聚物自组装的研究

国家自然科学基金

0+阅读 · 2014年12月31日

电化学聚合薄膜的结构与功能调控及光电器件应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

含生物可降解嵌段的氢键键合两亲性超分子共聚物的胶束化与溶液自组装

国家自然科学基金

0+阅读 · 2012年12月31日

层状过渡金属硫属化合物的合成、晶体结构和超导电性研究

国家自然科学基金

0+阅读 · 2012年12月31日

锌基三层核壳结构纳米复合材料对DNA分子的选择性识别与检测

国家自然科学基金

0+阅读 · 2011年12月31日

含有Me-Nx（M=Fe、Co、Mo，X=1-4）结构单元氧还原电催化剂的研究

国家自然科学基金

0+阅读 · 2009年12月31日

生物成因铁氧化物矿物矿化模型研究

国家自然科学基金

0+阅读 · 2008年12月31日

Attention-based CNN-LSTM and XGBoost hybrid model for stock prediction

Arxiv

0+阅读 · 2023年2月22日

Multiscale Sampling for the Inverse Modeling of Partial Differential Equations

Arxiv

0+阅读 · 2023年2月22日

Bokeh Rendering Based on Adaptive Depth Calibration Network

Bokeh Rendering Based on Adaptive Depth Calibration Network

Arxiv

0+阅读 · 2023年2月21日

Non-pooling Network for medical image segmentation

Arxiv

0+阅读 · 2023年2月21日

Ontology-aware Network for Zero-shot Sketch-based Image Retrieval

Arxiv

0+阅读 · 2023年2月20日

Interactive Face Video Coding: A Generative Compression Framework

Arxiv

0+阅读 · 2023年2月20日

Deep Neural Network Based Relation Extraction: An Overview

Arxiv

14+阅读 · 2021年1月6日

Spatio-Temporal Graph for Video Captioning with Knowledge Distillation

Spatio-Temporal Graph for Video Captioning with Knowledge Distillation

Arxiv

19+阅读 · 2020年3月31日

Linkage Based Face Clustering via Graph Convolution Network

Arxiv

16+阅读 · 2019年3月27日

Order-Free RNN with Visual Attention for Multi-Label Classification

Arxiv

16+阅读 · 2017年12月20日

VIP会员

文章信息

相关主题

长短期记忆网络

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能治理的未来

模态感知的特征匹配：单一模态与跨模态技术的全面综述

无监督行人重识别研究综述

【牛津博士论文】面向神经影像应用的可扩展且可解释的空间模型

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

相关论文

Attention-based CNN-LSTM and XGBoost hybrid model for stock prediction

Arxiv

0+阅读 · 2023年2月22日

Multiscale Sampling for the Inverse Modeling of Partial Differential Equations

Arxiv

0+阅读 · 2023年2月22日

Bokeh Rendering Based on Adaptive Depth Calibration Network

Bokeh Rendering Based on Adaptive Depth Calibration Network

Arxiv

0+阅读 · 2023年2月21日

Non-pooling Network for medical image segmentation

Arxiv

0+阅读 · 2023年2月21日

Ontology-aware Network for Zero-shot Sketch-based Image Retrieval

Arxiv

0+阅读 · 2023年2月20日

Interactive Face Video Coding: A Generative Compression Framework

Arxiv

0+阅读 · 2023年2月20日

Deep Neural Network Based Relation Extraction: An Overview

Arxiv

14+阅读 · 2021年1月6日

Spatio-Temporal Graph for Video Captioning with Knowledge Distillation

Spatio-Temporal Graph for Video Captioning with Knowledge Distillation

Arxiv

19+阅读 · 2020年3月31日

Linkage Based Face Clustering via Graph Convolution Network

Arxiv

16+阅读 · 2019年3月27日

Order-Free RNN with Visual Attention for Multi-Label Classification

Arxiv

16+阅读 · 2017年12月20日

相关基金

手性磷酸和过渡金属共催化的串联反应研究：高对映选择性地合成杂环化合物

国家自然科学基金

1+阅读 · 2015年12月31日

聚偏氟乙烯嵌段共聚物的溶液结晶驱动自组装研究

国家自然科学基金

0+阅读 · 2015年12月31日

电沉积制备结构和性能可控的纳米结构硅合金膜层的研究

国家自然科学基金

0+阅读 · 2014年12月31日

均聚物自组装的研究

国家自然科学基金

0+阅读 · 2014年12月31日

电化学聚合薄膜的结构与功能调控及光电器件应用研究

国家自然科学基金

0+阅读 · 2013年12月31日

含生物可降解嵌段的氢键键合两亲性超分子共聚物的胶束化与溶液自组装

国家自然科学基金

0+阅读 · 2012年12月31日

层状过渡金属硫属化合物的合成、晶体结构和超导电性研究

国家自然科学基金

0+阅读 · 2012年12月31日

锌基三层核壳结构纳米复合材料对DNA分子的选择性识别与检测

国家自然科学基金

0+阅读 · 2011年12月31日

含有Me-Nx（M=Fe、Co、Mo，X=1-4）结构单元氧还原电催化剂的研究

国家自然科学基金

0+阅读 · 2009年12月31日

生物成因铁氧化物矿物矿化模型研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员