逐步学习人类-机器人合作概率运动先质(ProMPs) (Incremental Learning of Probabilistic Movement Primitives (ProMPs) for Human-Robot Cooperation) - 专知论文

会员服务 ·

0

学成 · 分解的 · 机器人 · Performer · 峰值 ·

2021 年 5 月 28 日

Incremental Learning of Probabilistic Movement Primitives (ProMPs) for Human-Robot Cooperation

翻译：逐步学习人类-机器人合作概率运动先质(ProMPs)

Daniel Schäle,Martin F. Stoelen,Erik Kyrkjebø

from arxiv, This work has been submitted to IROS 2021

For a successful deployment of physical Human-Robot Cooperation (pHRC), humans need to be able to teach robots new motor skills quickly. Probabilistic movement primitives (ProMPs) are a promising method to encode a robot's motor skills learned from human demonstrations in pHRC settings. However, most algorithms to learn ProMPs from human demonstrations operate in batch mode, which is not ideal in pHRC. In this paper we propose a new learning algorithm to learn ProMPs incrementally in pHRC settings. Our algorithm incorporates new demonstrations sequentially as they arrive, allowing humans to observe the robot's learning progress and incrementally shape the robot's motor skill. A built in forgetting factor allows for corrective demonstrations resulting from the human's learning curve or changes in task constraints. We compare the performance of our algorithm to existing batch ProMP algorithms on reference data generated from a pick-and-place task at our lab. Furthermore, we show in a proof of concept study on a Franka Emika Panda how the forgetting factor allows us to adopt changes in the task. The incremental learning algorithm presented in this paper has the potential to lead to a more intuitive learning progress and to establish a successful cooperation between human and robot faster than training in batch mode.

翻译：为了成功部署人体-机器人合作(pHRC),人类需要能够迅速教授机器人新的运动技能。概率运动原始(ProMPs)是将机器人在PHRC环境中的人类演示中学到的运动技能编码起来的一个很有希望的方法。然而,从人类演示中学习ProMP的多数算法都以批量模式运作,这在pHRC中并不理想。在本文中,我们提出一种新的学习算法,以在 pHRC 设置中逐步学习ProMP。我们的算法包含在机器人到达时按顺序排列的新演示,允许人类观察机器人的学习进展并逐步塑造机器人的运动技能。在遗忘因素中构建的可因人类学习曲线或任务限制的变化而导致的纠正演示。我们比较我们从人类演示中学习ProMP算法的功能与从我们实验室的选位任务中获得的参考数据的现有批量 ProMP算法的性能。此外,我们在概念研究中展示了弗朗卡·埃米卡·潘达(Franka Emika Panda) 的遗忘因素如何允许我们在任务中采用变化。在本文中提供的递增学算法式算法中有可能在机器人和更成功的学习方式上建立一种机器人之间的学习模式。

0

相关内容

【经典书】机器学习黑客秘笈(Machine Learning for Hackers)，322页pdf

专知会员服务

46+阅读 · 2021年2月8日

元强化学习综述及前沿进展

元强化学习综述及前沿进展

专知会员服务

62+阅读 · 2021年1月31日

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

专知会员服务

28+阅读 · 2020年6月13日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

最新415页《人工智能与机器人原理》（Principles of Robotics & Artificial Intelligence）书籍

最新415页《人工智能与机器人原理》（Principles of Robotics & Artificial Intelligence）书籍

专知会员服务

56+阅读 · 2019年10月27日

深度学习界圣经“花书”《Deep Learning》中文版来了

深度学习界圣经“花书”《Deep Learning》中文版来了

专知会员服务

240+阅读 · 2019年10月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

【IJCAI 2019 Tutorials】基于概率图模型的医疗决策分析（Medical decision analysis with probabilistic graphical models）

【IJCAI 2019 Tutorials】基于概率图模型的医疗决策分析（Medical decision analysis with probabilistic graphical models）

专知会员服务

46+阅读 · 2019年8月10日

强化学习扫盲贴：从Q-learning到DQN

强化学习扫盲贴：从Q-learning到DQN

夕小瑶的卖萌屋

52+阅读 · 2019年10月13日

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年6月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

计算机 | ISMAR 2019等国际会议信息8条

计算机 | ISMAR 2019等国际会议信息8条

Call4Papers

3+阅读 · 2019年3月5日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Learning compliant grasping and manipulation by teleoperation with adaptive force control

Learning compliant grasping and manipulation by teleoperation with adaptive force control

Arxiv

0+阅读 · 2021年7月21日

Overcoming Some Drawbacks of Dynamic Movement Primitives

Arxiv

0+阅读 · 2021年7月20日

Relay-Assisted Cooperative Federated Learning

Relay-Assisted Cooperative Federated Learning

Arxiv

0+阅读 · 2021年7月20日

Ontology-Assisted Generalisation of Robot Action Execution Knowledge

Arxiv

0+阅读 · 2021年7月20日

A Multi-UAV System for Exploration and Target Finding in Cluttered and GPS-Denied Environments

A Multi-UAV System for Exploration and Target Finding in Cluttered and GPS-Denied Environments

Arxiv

0+阅读 · 2021年7月19日

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Arxiv

20+阅读 · 2020年3月10日

Few-shot Learning: A Survey

Few-shot Learning: A Survey

Arxiv

363+阅读 · 2019年4月10日

Risk-Aware Active Inverse Reinforcement Learning

Risk-Aware Active Inverse Reinforcement Learning

Arxiv

8+阅读 · 2019年1月8日

Generalizing Across Multi-Objective Reward Functions in Deep Reinforcement Learning

Generalizing Across Multi-Objective Reward Functions in Deep Reinforcement Learning

Arxiv

5+阅读 · 2018年9月17日

A survey on policy search algorithms for learning robot controllers in a handful of trials

Arxiv

3+阅读 · 2018年7月6日

VIP会员

文章信息

相关主题

相关VIP内容

【经典书】机器学习黑客秘笈(Machine Learning for Hackers)，322页pdf

专知会员服务

46+阅读 · 2021年2月8日

元强化学习综述及前沿进展

元强化学习综述及前沿进展

专知会员服务

62+阅读 · 2021年1月31日

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

【深度学习社区检测】Deep Learning for Community Detection: Progress, Challenges and Opportunities

专知会员服务

28+阅读 · 2020年6月13日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

最新415页《人工智能与机器人原理》（Principles of Robotics & Artificial Intelligence）书籍

最新415页《人工智能与机器人原理》（Principles of Robotics & Artificial Intelligence）书籍

专知会员服务

56+阅读 · 2019年10月27日

深度学习界圣经“花书”《Deep Learning》中文版来了

深度学习界圣经“花书”《Deep Learning》中文版来了

专知会员服务

240+阅读 · 2019年10月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

【IJCAI 2019 Tutorials】基于概率图模型的医疗决策分析（Medical decision analysis with probabilistic graphical models）

【IJCAI 2019 Tutorials】基于概率图模型的医疗决策分析（Medical decision analysis with probabilistic graphical models）

专知会员服务

46+阅读 · 2019年8月10日

热门VIP内容

开通专知VIP会员享更多权益服务

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

相关资讯

强化学习扫盲贴：从Q-learning到DQN

强化学习扫盲贴：从Q-learning到DQN

夕小瑶的卖萌屋

52+阅读 · 2019年10月13日

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年6月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

计算机 | ISMAR 2019等国际会议信息8条

计算机 | ISMAR 2019等国际会议信息8条

Call4Papers

3+阅读 · 2019年3月5日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

人工智能 | 国际会议截稿信息9条

人工智能 | 国际会议截稿信息9条

Call4Papers

4+阅读 · 2018年3月13日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Learning compliant grasping and manipulation by teleoperation with adaptive force control

Learning compliant grasping and manipulation by teleoperation with adaptive force control

Arxiv

0+阅读 · 2021年7月21日

Overcoming Some Drawbacks of Dynamic Movement Primitives

Arxiv

0+阅读 · 2021年7月20日

Relay-Assisted Cooperative Federated Learning

Relay-Assisted Cooperative Federated Learning

Arxiv

0+阅读 · 2021年7月20日

Ontology-Assisted Generalisation of Robot Action Execution Knowledge

Arxiv

0+阅读 · 2021年7月20日

A Multi-UAV System for Exploration and Target Finding in Cluttered and GPS-Denied Environments

A Multi-UAV System for Exploration and Target Finding in Cluttered and GPS-Denied Environments

Arxiv

0+阅读 · 2021年7月19日

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey

Arxiv

20+阅读 · 2020年3月10日

Few-shot Learning: A Survey

Few-shot Learning: A Survey

Arxiv

363+阅读 · 2019年4月10日

Risk-Aware Active Inverse Reinforcement Learning

Risk-Aware Active Inverse Reinforcement Learning

Arxiv

8+阅读 · 2019年1月8日

Generalizing Across Multi-Objective Reward Functions in Deep Reinforcement Learning

Generalizing Across Multi-Objective Reward Functions in Deep Reinforcement Learning

Arxiv

5+阅读 · 2018年9月17日

A survey on policy search algorithms for learning robot controllers in a handful of trials

Arxiv

3+阅读 · 2018年7月6日

微信扫码咨询专知VIP会员