使用机械手臂作为增强腿部运动稳定性的尾巴 (More Than an Arm: Using a Manipulator as a Tail for Enhanced Stability in Legged Locomotion) - 专知论文

会员服务 ·

0

机械手 · 控制器 · 机器人 · 协同作用 · 姿态控制 ·

2023 年 5 月 2 日

More Than an Arm: Using a Manipulator as a Tail for Enhanced Stability in Legged Locomotion

翻译：使用机械手臂作为增强腿部运动稳定性的尾巴

Huang Huang,Antonio Loquercio,Ashish Kumar,Neerja Thakkar,Ken Goldberg,Jitendra Malik

Is a manipulator on a legged robot a liability or an asset for locomotion? Prior works mainly designed specific controllers to account for the added payload and inertia from a manipulator. In contrast, biological systems typically benefit from additional limbs, which can simplify postural control. For instance, cats use their tails to enhance the stability of their bodies and prevent falls under disturbances. In this work, we show that a manipulator can be an important asset for maintaining balance during locomotion. To do so, we train a sensorimotor policy using deep reinforcement learning to create a synergy between the robot's limbs. This policy enables the robot to maintain stability despite large disturbances. However, learning such a controller can be quite challenging. To account for these challenges, we propose a stage-wise training procedure to learn complex behaviors. Our proposed method decomposes this complex task into three stages and then incrementally learns these tasks to arrive at a single policy capable of solving the final control task, achieving a success rate up to 2.35 times higher than baselines in simulation. We deploy our learned policy in the real world and show stability during locomotion under strong disturbances.

翻译：在机器人的运动中，机械手臂是负担还是资产？以前的研究主要设计特定的控制器来考虑来自机械手臂的额外负载和惯性。相比之下，生物系统通常受益于额外的肢体，这可以简化姿态控制。例如，猫使用它们的尾巴来增强身体的稳定性并防止在干扰下跌倒。在这项工作中，我们展示了在运动过程中机械手臂可以是保持平衡的重要资产。为此，我们使用深度强化学习训练感觉运动策略来创建机器人四肢之间的协同作用。该策略使机器人在大干扰下保持稳定。然而，学习这样的控制器可以是相当具有挑战性的。为了解决这些挑战，我们提出了一个分阶段训练程序来学习复杂的行为。我们提出的方法将这个复杂的任务分解成三个阶段，然后逐步学习这些任务，从而实现一个能够解决最终控制任务的单一策略，在仿真中实现了高达基线的2.35倍成功率。我们将学习到的策略部署在现实世界中，显示在强烈干扰下的运动稳定性。

0

相关内容

机械手

【硬核书】规划算法 (Planning Algorithm)，1023页pdf，Steven M. Illinois大学

【硬核书】规划算法 (Planning Algorithm)，1023页pdf，Steven M. Illinois大学

专知会员服务

167+阅读 · 2022年4月10日

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

专知会员服务

21+阅读 · 2022年3月18日

【AAAI2022】一种基于状态扰动的鲁棒强化学习算法

【AAAI2022】一种基于状态扰动的鲁棒强化学习算法

专知会员服务

36+阅读 · 2022年1月31日

《行为与认知机器人学》，241页pdf

《行为与认知机器人学》，241页pdf

专知会员服务

54+阅读 · 2021年4月11日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【斯坦福大学】Gradient Surgery for Multi-Task Learning

【斯坦福大学】Gradient Surgery for Multi-Task Learning

专知会员服务

47+阅读 · 2020年1月23日

【O'Reilly AI Conference 2019】使用GPU和Docker容器进行Horovod和Spark深度学习（Deep learning with Horovod and Spark using GPUs and Docker containers），BlueData的联合创始人兼首席架构师Thomas Phelan

【O'Reilly AI Conference 2019】使用GPU和Docker容器进行Horovod和Spark深度学习（Deep learning with Horovod and Spark using GPUs and Docker containers），BlueData的联合创始人兼首席架构师Thomas Phelan

专知会员服务

21+阅读 · 2019年11月5日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

只需1次演示，1小时在线训练，机器人真就做到看一遍就会了

只需1次演示，1小时在线训练，机器人真就做到看一遍就会了

机器之心

1+阅读 · 2022年7月15日

强化学习在机器人中的应用，附视频与Slides，Animesh Garg, UoT

强化学习在机器人中的应用，附视频与Slides，Animesh Garg, UoT

专知

2+阅读 · 2022年7月12日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

泡泡机器人SLAM

11+阅读 · 2019年5月22日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

神经反馈康复训练的反馈策略和控制方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

水凝胶携带NgR1沉默的神经干细胞移植治疗脊髓损伤的研究

国家自然科学基金

0+阅读 · 2014年12月31日

不确定条件下基于分群策略的柔性Flow Shop调度问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向异构环境自主巡航的仿人机器人运动规划及多足平台推广研究

国家自然科学基金

0+阅读 · 2013年12月31日

四足哺乳动物疾驰机理若干问题

国家自然科学基金

0+阅读 · 2012年12月31日

基于对运动神经元智能探索的新型自适应学习控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

环境诱导家蚕滞育的CREB调控机制

国家自然科学基金

0+阅读 · 2011年12月31日

Pharicin B稳定维甲酸受体的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

室温低压稳定半笼形水合物的合成、结构调控与性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

微分对策数值解法及非线性系统Min-Max鲁棒后退时域控制算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

Sample-Efficient On-Policy Imitation Learning from Observations

Arxiv

0+阅读 · 2023年6月16日

Residual Q-Learning: Offline and Online Policy Customization without Value

Arxiv

0+阅读 · 2023年6月15日

Tell Me Where to Go: A Composable Framework for Context-Aware Embodied Robot Navigation

Arxiv

0+阅读 · 2023年6月15日

A Framework for Learning from Demonstration with Minimal Human Effort

Arxiv

0+阅读 · 2023年6月15日

Language to Rewards for Robotic Skill Synthesis

Arxiv

0+阅读 · 2023年6月14日

The Quality-Diversity Transformer: Generating Behavior-Conditioned Trajectories with Decision Transformers

Arxiv

0+阅读 · 2023年6月14日

Expanding Versatility of Agile Locomotion through Policy Transitions Using Latent State Representation

Arxiv

0+阅读 · 2023年6月14日

Learning When to Ask for Help: Transferring Human Knowledge through Part-Time Demonstration

Arxiv

0+阅读 · 2023年6月14日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Arxiv

21+阅读 · 2019年3月27日

VIP会员

文章信息

相关主题

相关VIP内容

【硬核书】规划算法 (Planning Algorithm)，1023页pdf，Steven M. Illinois大学

【硬核书】规划算法 (Planning Algorithm)，1023页pdf，Steven M. Illinois大学

专知会员服务

167+阅读 · 2022年4月10日

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

【Hugging Face】使用自定义数据集微调语义分割模型，Fine-Tune a Semantic Segmentation Model with a Custom Dataset

专知会员服务

21+阅读 · 2022年3月18日

【AAAI2022】一种基于状态扰动的鲁棒强化学习算法

【AAAI2022】一种基于状态扰动的鲁棒强化学习算法

专知会员服务

36+阅读 · 2022年1月31日

《行为与认知机器人学》，241页pdf

《行为与认知机器人学》，241页pdf

专知会员服务

54+阅读 · 2021年4月11日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【斯坦福大学】Gradient Surgery for Multi-Task Learning

【斯坦福大学】Gradient Surgery for Multi-Task Learning

专知会员服务

47+阅读 · 2020年1月23日

【O'Reilly AI Conference 2019】使用GPU和Docker容器进行Horovod和Spark深度学习（Deep learning with Horovod and Spark using GPUs and Docker containers），BlueData的联合创始人兼首席架构师Thomas Phelan

【O'Reilly AI Conference 2019】使用GPU和Docker容器进行Horovod和Spark深度学习（Deep learning with Horovod and Spark using GPUs and Docker containers），BlueData的联合创始人兼首席架构师Thomas Phelan

专知会员服务

21+阅读 · 2019年11月5日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《太空边缘（临近空间）的武器化？军事高空平台的进展与前景》

《利用星基增强系统（SBAS）信号进行射频干扰（RFI）检测与特征分析》

美陆军在“艾布拉姆斯”坦克与“布拉德利”步战车上测试“牛蛙”反无人机炮塔

《军事领域特性及其对军事人工智能应用的影响》

相关资讯

只需1次演示，1小时在线训练，机器人真就做到看一遍就会了

只需1次演示，1小时在线训练，机器人真就做到看一遍就会了

机器之心

1+阅读 · 2022年7月15日

强化学习在机器人中的应用，附视频与Slides，Animesh Garg, UoT

强化学习在机器人中的应用，附视频与Slides，Animesh Garg, UoT

专知

2+阅读 · 2022年7月12日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

【泡泡一分钟】FarSight：从户外图像中实现远距离深度估计

泡泡机器人SLAM

11+阅读 · 2019年5月22日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

相关论文

Sample-Efficient On-Policy Imitation Learning from Observations

Arxiv

0+阅读 · 2023年6月16日

Residual Q-Learning: Offline and Online Policy Customization without Value

Arxiv

0+阅读 · 2023年6月15日

Tell Me Where to Go: A Composable Framework for Context-Aware Embodied Robot Navigation

Arxiv

0+阅读 · 2023年6月15日

A Framework for Learning from Demonstration with Minimal Human Effort

Arxiv

0+阅读 · 2023年6月15日

Language to Rewards for Robotic Skill Synthesis

Arxiv

0+阅读 · 2023年6月14日

The Quality-Diversity Transformer: Generating Behavior-Conditioned Trajectories with Decision Transformers

Arxiv

0+阅读 · 2023年6月14日

Expanding Versatility of Agile Locomotion through Policy Transitions Using Latent State Representation

Arxiv

0+阅读 · 2023年6月14日

Learning When to Ask for Help: Transferring Human Knowledge through Part-Time Demonstration

Arxiv

0+阅读 · 2023年6月14日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Arxiv

21+阅读 · 2019年3月27日

相关基金

神经反馈康复训练的反馈策略和控制方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

水凝胶携带NgR1沉默的神经干细胞移植治疗脊髓损伤的研究

国家自然科学基金

0+阅读 · 2014年12月31日

不确定条件下基于分群策略的柔性Flow Shop调度问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

面向异构环境自主巡航的仿人机器人运动规划及多足平台推广研究

国家自然科学基金

0+阅读 · 2013年12月31日

四足哺乳动物疾驰机理若干问题

国家自然科学基金

0+阅读 · 2012年12月31日

基于对运动神经元智能探索的新型自适应学习控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

环境诱导家蚕滞育的CREB调控机制

国家自然科学基金

0+阅读 · 2011年12月31日

Pharicin B稳定维甲酸受体的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

室温低压稳定半笼形水合物的合成、结构调控与性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

微分对策数值解法及非线性系统Min-Max鲁棒后退时域控制算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员