利用适应性力量影响行动空间学习机器人操纵技能 (Learning Robotic Manipulation Skills Using an Adaptive Force-Impedance Action Space) - 专知论文

会员服务 ·

0

学成 · Performer · 多峰值 · 优化器 · FAST ·

2021 年 10 月 19 日

Learning Robotic Manipulation Skills Using an Adaptive Force-Impedance Action Space

翻译：利用适应性力量影响行动空间学习机器人操纵技能

Maximilian Ulmer,Elie Aljalbout,Sascha Schwarz,Sami Haddadin

Intelligent agents must be able to think fast and slow to perform elaborate manipulation tasks. Reinforcement Learning (RL) has led to many promising results on a range of challenging decision-making tasks. However, in real-world robotics, these methods still struggle, as they require large amounts of expensive interactions and have slow feedback loops. On the other hand, fast human-like adaptive control methods can optimize complex robotic interactions, yet fail to integrate multimodal feedback needed for unstructured tasks. In this work, we propose to factor the learning problem in a hierarchical learning and adaption architecture to get the best of both worlds. The framework consists of two components, a slow reinforcement learning policy optimizing the task strategy given multimodal observations, and a fast, real-time adaptive control policy continuously optimizing the motion, stability, and effort of the manipulator. We combine these components through a bio-inspired action space that we call AFORCE. We demonstrate the new action space on a contact-rich manipulation task on real hardware and evaluate its performance on three simulated manipulation tasks. Our experiments show that AFORCE drastically improves sample efficiency while reducing energy consumption and improving safety.

翻译：智能剂必须能够快速和缓慢地思考复杂的操作任务。强化学习(RL)在一系列具有挑战性的决策任务中带来了许多有希望的结果。然而,在现实世界的机器人中,这些方法仍在挣扎,因为它们需要大量昂贵的互动和缓慢的反馈回路。另一方面,快速的人型适应性控制方法可以优化复杂的机器人互动,但不能将非结构化任务所需的多式联运反馈结合起来。在这项工作中,我们提议将学习问题纳入一个等级化学习和适应结构,以获得两个世界的最佳成果。这个框架由两个部分组成:一个缓慢的强化学习政策,在多式观察下优化任务战略,以及一个快速、实时的适应性控制政策,不断优化操纵者的运动、稳定和努力。我们通过一个生物激励行动空间将这些组成部分结合起来,我们称之为AFORCE。我们展示了在实际硬件上接触丰富的操纵任务上的新行动空间,并评价其三项模拟操纵任务的业绩。我们的实验显示,AFORCE在减少能源消耗并改进安全的同时,大幅提高了抽样效率。

0

相关内容

【经典书】机器学习白话书，97页pdf，Machine Learning for Humans

【经典书】机器学习白话书，97页pdf，Machine Learning for Humans

专知会员服务

87+阅读 · 2021年1月11日

【斯坦福】机器学习优化简明导论， Introduction to Optimization for Machine Learning

【斯坦福】机器学习优化简明导论， Introduction to Optimization for Machine Learning

专知会员服务

93+阅读 · 2020年5月6日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【斯坦福大学】Gradient Surgery for Multi-Task Learning

【斯坦福大学】Gradient Surgery for Multi-Task Learning

专知会员服务

47+阅读 · 2020年1月23日

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

专知会员服务

35+阅读 · 2019年12月12日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

161+阅读 · 2019年10月12日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【强化学习研讨会|Microsoft Research】利用批量强化学习确定医疗保健中的治疗方案（Towards Using Batch Reinforcement Learning to Identify Treatment Options in Healthcare）,哈佛大学助理教授| Finale Doshi-Velez

【强化学习研讨会|Microsoft Research】利用批量强化学习确定医疗保健中的治疗方案（Towards Using Batch Reinforcement Learning to Identify Treatment Options in Healthcare）,哈佛大学助理教授| Finale Doshi-Velez

专知会员服务

7+阅读 · 2019年10月3日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

大神一年100篇论文

大神一年100篇论文

CreateAMind

15+阅读 · 2018年12月31日

spinningup.openai 强化学习资源完整

spinningup.openai 强化学习资源完整

CreateAMind

6+阅读 · 2018年12月17日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

视觉机械臂 visual-pushing-grasping

视觉机械臂 visual-pushing-grasping

CreateAMind

3+阅读 · 2018年5月25日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Adaptive Mimic: Deep Reinforcement Learning of Parameterized Bipedal Walking from Infeasible References

Arxiv

0+阅读 · 2021年12月13日

Learn from Human Teams: a Probabilistic Solution to Real-Time Collaborative Robot Handling with Dynamic Gesture Commands

Arxiv

0+阅读 · 2021年12月11日

Model-Free Safety-Critical Control for Robotic Systems

Arxiv

0+阅读 · 2021年12月10日

Learning multiple gaits of quadruped robot using hierarchical reinforcement learning

Arxiv

0+阅读 · 2021年12月9日

Adaptive CLF-MPC With Application To Quadrupedal Robots

Arxiv

0+阅读 · 2021年12月8日

Guided Imitation of Task and Motion Planning

Arxiv

0+阅读 · 2021年12月6日

Learning of Long-Horizon Sparse-Reward Robotic Manipulator Tasks with Base Controllers

Arxiv

0+阅读 · 2021年12月4日

Bipedal Walking Robot using Deep Deterministic Policy Gradient

Bipedal Walking Robot using Deep Deterministic Policy Gradient

Arxiv

3+阅读 · 2018年7月16日

Parameter Space Noise for Exploration

Arxiv

3+阅读 · 2018年1月31日

Safety-aware Adaptive Reinforcement Learning with Applications to Brushbot Navigation

Arxiv

4+阅读 · 2018年1月29日

VIP会员

文章信息

相关主题

相关VIP内容

【经典书】机器学习白话书，97页pdf，Machine Learning for Humans

【经典书】机器学习白话书，97页pdf，Machine Learning for Humans

专知会员服务

87+阅读 · 2021年1月11日

【斯坦福】机器学习优化简明导论， Introduction to Optimization for Machine Learning

【斯坦福】机器学习优化简明导论， Introduction to Optimization for Machine Learning

专知会员服务

93+阅读 · 2020年5月6日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【斯坦福大学】Gradient Surgery for Multi-Task Learning

【斯坦福大学】Gradient Surgery for Multi-Task Learning

专知会员服务

47+阅读 · 2020年1月23日

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

专知会员服务

35+阅读 · 2019年12月12日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

161+阅读 · 2019年10月12日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【强化学习研讨会|Microsoft Research】利用批量强化学习确定医疗保健中的治疗方案（Towards Using Batch Reinforcement Learning to Identify Treatment Options in Healthcare）,哈佛大学助理教授| Finale Doshi-Velez

【强化学习研讨会|Microsoft Research】利用批量强化学习确定医疗保健中的治疗方案（Towards Using Batch Reinforcement Learning to Identify Treatment Options in Healthcare）,哈佛大学助理教授| Finale Doshi-Velez

专知会员服务

7+阅读 · 2019年10月3日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体化人工智能：架构、应用及未来发展方向的综合综述

《自主武器》365页书籍

联邦学习综述：多层次聚合技术的系统分类、实验洞察与未来前沿

人工智能在空战中的局限及其真正适用领域

相关资讯

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

大神一年100篇论文

大神一年100篇论文

CreateAMind

15+阅读 · 2018年12月31日

spinningup.openai 强化学习资源完整

spinningup.openai 强化学习资源完整

CreateAMind

6+阅读 · 2018年12月17日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

视觉机械臂 visual-pushing-grasping

视觉机械臂 visual-pushing-grasping

CreateAMind

3+阅读 · 2018年5月25日

Reinforcement Learning: An Introduction 2018第二版 500页

Reinforcement Learning: An Introduction 2018第二版 500页

CreateAMind

14+阅读 · 2018年4月27日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Adaptive Mimic: Deep Reinforcement Learning of Parameterized Bipedal Walking from Infeasible References

Arxiv

0+阅读 · 2021年12月13日

Learn from Human Teams: a Probabilistic Solution to Real-Time Collaborative Robot Handling with Dynamic Gesture Commands

Arxiv

0+阅读 · 2021年12月11日

Model-Free Safety-Critical Control for Robotic Systems

Arxiv

0+阅读 · 2021年12月10日

Learning multiple gaits of quadruped robot using hierarchical reinforcement learning

Arxiv

0+阅读 · 2021年12月9日

Adaptive CLF-MPC With Application To Quadrupedal Robots

Arxiv

0+阅读 · 2021年12月8日

Guided Imitation of Task and Motion Planning

Arxiv

0+阅读 · 2021年12月6日

Learning of Long-Horizon Sparse-Reward Robotic Manipulator Tasks with Base Controllers

Arxiv

0+阅读 · 2021年12月4日

Bipedal Walking Robot using Deep Deterministic Policy Gradient

Bipedal Walking Robot using Deep Deterministic Policy Gradient

Arxiv

3+阅读 · 2018年7月16日

Parameter Space Noise for Exploration

Arxiv

3+阅读 · 2018年1月31日

Safety-aware Adaptive Reinforcement Learning with Applications to Brushbot Navigation

Arxiv

4+阅读 · 2018年1月29日

微信扫码咨询专知VIP会员