世界模型中的隔离和杠杆利用可控和不可控视觉动态 (Isolating and Leveraging Controllable and Noncontrollable Visual Dynamics in World Models) - 专知论文

会员服务 ·

0

控制器 · MoDELS · 学成 · 优化器 · INTERACT ·

2022 年 5 月 27 日

Isolating and Leveraging Controllable and Noncontrollable Visual Dynamics in World Models

翻译：世界模型中的隔离和杠杆利用可控和不可控视觉动态

Minting Pan,Xiangming Zhu,Yunbo Wang,Xiaokang Yang

World models learn the consequences of actions in vision-based interactive systems. However, in practical scenarios such as autonomous driving, there commonly exists noncontrollable dynamics independent of the action signals, making it difficult to learn effective world models. To tackle this problem, we present a novel reinforcement learning approach named Iso-Dream, which improves the Dream-to-Control framework in two aspects. First, by optimizing the inverse dynamics, we encourage the world model to learn controllable and noncontrollable sources of spatiotemporal changes on isolated state transition branches. Second, we optimize the behavior of the agent on the decoupled latent imaginations of the world model. Specifically, to estimate state values, we roll-out the noncontrollable states into the future and associate them with the current controllable state. In this way, the isolation of dynamics sources can greatly benefit long-horizon decision-making of the agent, such as a self-driving car that can avoid potential risks by anticipating the movement of other vehicles. Experiments show that Iso-Dream is effective in decoupling the mixed dynamics and remarkably outperforms existing approaches in a wide range of visual control and prediction domains.

翻译：世界模型在基于愿景的互动系统中学习行动的后果。然而,在诸如自主驱动等实际情景中,通常存在与行动信号无关的无法控制的动态,因此难以学习有效的世界模型。为了解决这一问题,我们提出了名为Iso-Dream的新型强化学习方法,它从两个方面改进了梦想到控制的框架。首先,通过优化反向动态,我们鼓励世界模型学习在孤立的州过渡分支上可控和不可控制的空间变化的来源。第二,我们优化了代理人在脱钩的世界模型潜在想象力上的行为。具体地说,为了估算国家价值,我们将不可控状态推广到未来,并将它们与当前可控状态联系起来。这样,动态源的孤立可以极大地有利于代理人的长期同步决策,例如,通过预测其他飞行器的移动可以避免潜在风险的自我驱动汽车。实验表明,Iso-Dream在将混合的动态和清晰的视野范围外的视觉控制方法中可以有效脱钩。

0

相关内容

控制器

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

石墨烯叠层遂穿异质结的超宽波段光电转换机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

分子光开关用于嵌段共聚物自组装纳米结构的超分辨荧光成像

国家自然科学基金

0+阅读 · 2014年12月31日

基于手性无机纳米晶自组装构建多尺度手性纳米结构的研究

国家自然科学基金

0+阅读 · 2014年12月31日

结晶和反应结晶过程的介尺度机理研究和模拟调控

国家自然科学基金

0+阅读 · 2013年12月31日

活性软材料中高强度聚焦超声波传播特性及其辐射力效应研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向DDS的自驱动Pt纳米机器人运动控制机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

复合稀土层状氢氧化物的可控合成、剥离及透明荧光取向膜的纳米片组装与光学特性

国家自然科学基金

0+阅读 · 2011年12月31日

中介尺度活塞式内燃机强时变性微燃烧过程的测试与机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

PGRMC1蛋白在肾癌中的功能及作用机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models

Arxiv

0+阅读 · 2022年7月15日

The edge of discovery: Controlling the local false discovery rate at the margin

Arxiv

0+阅读 · 2022年7月15日

On the Strong Correlation Between Model Invariance and Generalization

On the Strong Correlation Between Model Invariance and Generalization

Arxiv

0+阅读 · 2022年7月14日

Factorized and Controllable Neural Re-Rendering of Outdoor Scene for Photo Extrapolation

Arxiv

0+阅读 · 2022年7月14日

Ranking and Tuning Pre-trained Models: A New Paradigm for Exploiting Model Hubs

Arxiv

0+阅读 · 2022年7月14日

Unsupervised Learning for Combinatorial Optimization with Principled Objective Relaxation

Arxiv

0+阅读 · 2022年7月14日

Explainable Artificial Intelligence for Autonomous Driving: A Comprehensive Overview and Field Guide for Future Research Directions

Arxiv

18+阅读 · 2021年12月21日

Controllable Multi-Interest Framework for Recommendation

Arxiv

18+阅读 · 2020年8月3日

Latent Relation Language Models

Arxiv

21+阅读 · 2019年8月21日

f-VAEGAN-D2: A Feature Generating Framework for Any-Shot Learning

Arxiv

11+阅读 · 2019年3月25日

VIP会员

文章信息

相关主题

相关VIP内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

数据智能体综述：新兴范式还是被高估的炒作？

海底战已至：美国构思海底安全战略 | 最新报告

【ICCV2025教程】视觉异常检测中的基础模型：进展、挑战与应用

美军将无人自主等新技术融入潜艇部队以更具杀伤力

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models

Arxiv

0+阅读 · 2022年7月15日

The edge of discovery: Controlling the local false discovery rate at the margin

Arxiv

0+阅读 · 2022年7月15日

On the Strong Correlation Between Model Invariance and Generalization

On the Strong Correlation Between Model Invariance and Generalization

Arxiv

0+阅读 · 2022年7月14日

Factorized and Controllable Neural Re-Rendering of Outdoor Scene for Photo Extrapolation

Arxiv

0+阅读 · 2022年7月14日

Ranking and Tuning Pre-trained Models: A New Paradigm for Exploiting Model Hubs

Arxiv

0+阅读 · 2022年7月14日

Unsupervised Learning for Combinatorial Optimization with Principled Objective Relaxation

Arxiv

0+阅读 · 2022年7月14日

Explainable Artificial Intelligence for Autonomous Driving: A Comprehensive Overview and Field Guide for Future Research Directions

Arxiv

18+阅读 · 2021年12月21日

Controllable Multi-Interest Framework for Recommendation

Arxiv

18+阅读 · 2020年8月3日

Latent Relation Language Models

Arxiv

21+阅读 · 2019年8月21日

f-VAEGAN-D2: A Feature Generating Framework for Any-Shot Learning

Arxiv

11+阅读 · 2019年3月25日

相关基金

石墨烯叠层遂穿异质结的超宽波段光电转换机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

分子光开关用于嵌段共聚物自组装纳米结构的超分辨荧光成像

国家自然科学基金

0+阅读 · 2014年12月31日

基于手性无机纳米晶自组装构建多尺度手性纳米结构的研究

国家自然科学基金

0+阅读 · 2014年12月31日

结晶和反应结晶过程的介尺度机理研究和模拟调控

国家自然科学基金

0+阅读 · 2013年12月31日

活性软材料中高强度聚焦超声波传播特性及其辐射力效应研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向DDS的自驱动Pt纳米机器人运动控制机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

复合稀土层状氢氧化物的可控合成、剥离及透明荧光取向膜的纳米片组装与光学特性

国家自然科学基金

0+阅读 · 2011年12月31日

中介尺度活塞式内燃机强时变性微燃烧过程的测试与机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

PGRMC1蛋白在肾癌中的功能及作用机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员