多模式延迟随机化 (Vision-Guided Quadrupedal Locomotion in the Wild with Multi-Modal Delay Randomization) - 专知论文

会员服务 ·

0

控制器 · Projection · Vision · Learning · 机器人 ·

2022 年 7 月 24 日

Vision-Guided Quadrupedal Locomotion in the Wild with Multi-Modal Delay Randomization

翻译：多模式延迟随机化

Chieko Sarah Imai,Minghao Zhang,Yuchen Zhang,Marcin Kierebinski,Ruihan Yang,Yuzhe Qin,Xiaolong Wang

from arxiv, IROS 2022, Project page: https://mehooz.github.io/mmdr-wild/

Developing robust vision-guided controllers for quadrupedal robots in complex environments, with various obstacles, dynamical surroundings and uneven terrains, is very challenging. While Reinforcement Learning (RL) provides a promising paradigm for agile locomotion skills with vision inputs in simulation, it is still very challenging to deploy the RL policy in the real world. Our key insight is that aside from the discrepancy in the domain gap, in visual appearance between the simulation and the real world, the latency from the control pipeline is also a major cause of difficulty. In this paper, we propose Multi-Modal Delay Randomization (MMDR) to address this issue when training RL agents. Specifically, we simulate the latency of real hardware by using past observations, sampled with randomized periods, for both proprioception and vision. We train the RL policy for end-to-end control in a physical simulator without any predefined controller or reference motion, and directly deploy it on the real A1 quadruped robot running in the wild. We evaluate our method in different outdoor environments with complex terrains and obstacles. We demonstrate the robot can smoothly maneuver at a high speed, avoid the obstacles, and show significant improvement over the baselines. Our project page with videos is at https://mehooz.github.io/mmdr-wild/.

翻译：在复杂的环境中,有各种障碍、动态环境以及不均匀的地形,为四重机器人开发强大的视觉引导控制器非常具有挑战性。虽然强化学习(RL)为灵活的移动技能提供了充满希望的范例,在模拟中提供了视觉投入,但在现实世界中部署RL政策仍然非常困难。我们的关键见解是,除了在模拟和现实世界的视觉外,在模拟与真实世界的视觉外观上存在差异外,控制管道的悬浮也是一个主要困难原因。在本文中,我们提议在培训RL代理时,多式延迟随机化(MMDR)来解决这个问题。具体地说,我们通过使用以往的观测(通过随机的周期抽样)来模拟真实硬件的宽度。我们训练RL政策在实际模拟器中进行端到端的控制,而没有预先定义的控制器或参考动作,直接在野外运行的真正的A1四重的机器人上部署。我们用复杂的地形和障碍来评估我们在不同室外环境中使用的方法。我们用高速度和障碍来模拟真实的机器人,我们用高速度来展示我们的机器人的基线/图像。我们可以避开。

0

相关内容

控制器

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

捷变PRF高分辨宽覆盖星载SAR信号处理技术

国家自然科学基金

0+阅读 · 2013年12月31日

ADS中检测快中子束的GEM探测器的研制

国家自然科学基金

0+阅读 · 2013年12月31日

基于AOD-FLIM/CARS多模光学平台监测单个活细胞内RNA合成过程

国家自然科学基金

0+阅读 · 2013年12月31日

氧化应激影响精子组蛋白转换和遗传稳定的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

LncRNAs在非小细胞肺癌EGFR-TKIs耐药中的作用及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

Li3V2(PO4)3快离子导体掺杂改性LiMnPO4/C纳米复合材料的基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于硅基纳米线波导微环谐振器的异或/同或光学逻辑运算单元及其阵列

国家自然科学基金

0+阅读 · 2009年12月31日

非晶稀土氧化物高k栅介质材料的制备及物理特性研究

国家自然科学基金

0+阅读 · 2008年12月31日

高灵敏FRET显微成像技术研究

国家自然科学基金

0+阅读 · 2008年12月31日

AlGaN基PIN太阳光盲雪崩探测器研究

国家自然科学基金

0+阅读 · 2008年12月31日

Learning to Walk by Steering: Perceptive Quadrupedal Locomotion in Dynamic Environments

Learning to Walk by Steering: Perceptive Quadrupedal Locomotion in Dynamic Environments

Arxiv

0+阅读 · 2022年9月19日

Towards self-attention based visual navigation in the real world

Towards self-attention based visual navigation in the real world

Arxiv

0+阅读 · 2022年9月19日

SLRNet: Semi-Supervised Semantic Segmentation Via Label Reuse for Human Decomposition Images

Arxiv

0+阅读 · 2022年9月19日

Multiple Waypoint Navigation in Unknown Indoor Environments

Arxiv

0+阅读 · 2022年9月18日

Multi-segmented Adaptive Feet for Versatile Legged Locomotion in Natural Terrain

Arxiv

0+阅读 · 2022年9月18日

StereoVoxelNet: Real-Time Obstacle Detection Based on Occupancy Voxels from a Stereo Camera Using Deep Neural Networks

Arxiv

0+阅读 · 2022年9月18日

Cerberus: Low-Drift Visual-Inertial-Leg Odometry For Agile Locomotion

Arxiv

0+阅读 · 2022年9月16日

Causal Coupled Mechanisms: A Control Method with Cooperation and Competition for Complex System

Arxiv

0+阅读 · 2022年9月15日

Learning from Future: A Novel Self-Training Framework for Semantic Segmentation

Arxiv

0+阅读 · 2022年9月15日

Bipedal Robot Walking Control Using Human Whole-Body Dynamic Telelocomotion

Arxiv

0+阅读 · 2022年9月14日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Learning to Walk by Steering: Perceptive Quadrupedal Locomotion in Dynamic Environments

Learning to Walk by Steering: Perceptive Quadrupedal Locomotion in Dynamic Environments

Arxiv

0+阅读 · 2022年9月19日

Towards self-attention based visual navigation in the real world

Towards self-attention based visual navigation in the real world

Arxiv

0+阅读 · 2022年9月19日

SLRNet: Semi-Supervised Semantic Segmentation Via Label Reuse for Human Decomposition Images

Arxiv

0+阅读 · 2022年9月19日

Multiple Waypoint Navigation in Unknown Indoor Environments

Arxiv

0+阅读 · 2022年9月18日

Multi-segmented Adaptive Feet for Versatile Legged Locomotion in Natural Terrain

Arxiv

0+阅读 · 2022年9月18日

StereoVoxelNet: Real-Time Obstacle Detection Based on Occupancy Voxels from a Stereo Camera Using Deep Neural Networks

Arxiv

0+阅读 · 2022年9月18日

Cerberus: Low-Drift Visual-Inertial-Leg Odometry For Agile Locomotion

Arxiv

0+阅读 · 2022年9月16日

Causal Coupled Mechanisms: A Control Method with Cooperation and Competition for Complex System

Arxiv

0+阅读 · 2022年9月15日

Learning from Future: A Novel Self-Training Framework for Semantic Segmentation

Arxiv

0+阅读 · 2022年9月15日

Bipedal Robot Walking Control Using Human Whole-Body Dynamic Telelocomotion

Arxiv

0+阅读 · 2022年9月14日

相关基金

捷变PRF高分辨宽覆盖星载SAR信号处理技术

国家自然科学基金

0+阅读 · 2013年12月31日

ADS中检测快中子束的GEM探测器的研制

国家自然科学基金

0+阅读 · 2013年12月31日

基于AOD-FLIM/CARS多模光学平台监测单个活细胞内RNA合成过程

国家自然科学基金

0+阅读 · 2013年12月31日

氧化应激影响精子组蛋白转换和遗传稳定的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

LncRNAs在非小细胞肺癌EGFR-TKIs耐药中的作用及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

Li3V2(PO4)3快离子导体掺杂改性LiMnPO4/C纳米复合材料的基础研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于硅基纳米线波导微环谐振器的异或/同或光学逻辑运算单元及其阵列

国家自然科学基金

0+阅读 · 2009年12月31日

非晶稀土氧化物高k栅介质材料的制备及物理特性研究

国家自然科学基金

0+阅读 · 2008年12月31日

高灵敏FRET显微成像技术研究

国家自然科学基金

0+阅读 · 2008年12月31日

AlGaN基PIN太阳光盲雪崩探测器研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员