DMAP: 以变化体学习Locomot的分布式道德关注政策 (DMAP: a Distributed Morphological Attention Policy for Learning to Locomote with a Changing Body) - 专知论文

会员服务 ·

0

Agent · Learning · 控制器 · Performer · Attention ·

2022 年 9 月 28 日

DMAP: a Distributed Morphological Attention Policy for Learning to Locomote with a Changing Body

翻译：DMAP: 以变化体学习Locomot的分布式道德关注政策

Alberto Silvio Chiappa,Alessandro Marin Vargas,Alexander Mathis

Biological and artificial agents need to deal with constant changes in the real world. We study this problem in four classical continuous control environments, augmented with morphological perturbations. Learning to locomote when the length and the thickness of different body parts vary is challenging, as the control policy is required to adapt to the morphology to successfully balance and advance the agent. We show that a control policy based on the proprioceptive state performs poorly with highly variable body configurations, while an (oracle) agent with access to a learned encoding of the perturbation performs significantly better. We introduce DMAP, a biologically-inspired, attention-based policy network architecture. DMAP combines independent proprioceptive processing, a distributed policy with individual controllers for each joint, and an attention mechanism, to dynamically gate sensory information from different body parts to different controllers. Despite not having access to the (hidden) morphology information, DMAP can be trained end-to-end in all the considered environments, overall matching or surpassing the performance of an oracle agent. Thus DMAP, implementing principles from biological motor control, provides a strong inductive bias for learning challenging sensorimotor tasks. Overall, our work corroborates the power of these principles in challenging locomotion tasks.

翻译：生物和人工制剂需要应对真实世界的不断变化。我们用四种古典连续控制环境来研究这一问题, 并辅之以形态扰动。当身体各部分的长度和厚度不同时, 学习在滚动, 具有挑战性, 因为控制政策需要适应形态学, 以便成功地平衡和推进物剂。我们显示基于自我感知状态的控制政策在高度变异的体形配置下表现不佳, 而能够接触经学习的扰动编码的( oracle) 剂则表现得更好。我们引入了DMAP, 一种生物激发的、关注型政策网络架构。 DMAP 将独立自觉处理、与每个联合部位的单个控制器的分散政策以及关注机制结合起来, 以动态方式将感官信息从不同身体各部分传到不同的控制器。尽管无法接触( 隐蔽的) 形态信息, 但DMAP可以在所有考虑的环境中接受端到端端端端的训练, 总体匹配或超过一种或触摸物剂的性能。因此, DMAP, 执行生物运动控制的原则, 具有挑战性感官的整个任务。

0

相关内容

Agent

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

概率和平均框架下一系列Sobolev空间中的函数逼近与恢复

国家自然科学基金

1+阅读 · 2015年12月31日

青藏高原与东亚季风区云的时空分布和变化

国家自然科学基金

0+阅读 · 2015年12月31日

石墨烯等离子体晶体：能带的非线性调控和格子孤子

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

大扰动下同步发电机非线性模型和参数的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

具有可见光吸收特性的，基于BODIPY的金属有机骨架材料光催化剂的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Linked Open Data的Web服务语义互操作关键技术

国家自然科学基金

0+阅读 · 2012年12月31日

海洋吡咯生物碱的设计、合成与活性研究

国家自然科学基金

0+阅读 · 2011年12月31日

离子液体功能化手性Bronsted酸催化剂创制及其在催化反应中的应用

国家自然科学基金

0+阅读 · 2011年12月31日

Leveraging Fully Observable Policies for Learning under Partial Observability

Arxiv

0+阅读 · 2022年11月3日

Embed and Emulate: Learning to estimate parameters of dynamical systems with uncertainty quantification

Arxiv

0+阅读 · 2022年11月3日

An Exponentially Converging Particle Method for the Mixed Nash Equilibrium of Continuous Games

An Exponentially Converging Particle Method for the Mixed Nash Equilibrium of Continuous Games

Arxiv

0+阅读 · 2022年11月2日

Consistent Training via Energy-Based GFlowNets for Modeling Discrete Joint Distributions

Arxiv

0+阅读 · 2022年11月2日

Performance-Oriented Design for Intelligent Reflecting Surface Assisted Federated Learning

Arxiv

0+阅读 · 2022年11月2日

Statistical Learning from Biased Training Samples

Arxiv

0+阅读 · 2022年11月1日

Dungeons and Data: A Large-Scale NetHack Dataset

Arxiv

0+阅读 · 2022年11月1日

Phase-based Ranging in Narrowband Systems with Missing/Interfered Tones

Arxiv

0+阅读 · 2022年11月1日

How to train your solver: Verification of boundary conditions for smoothed particle hydrodynamics

Arxiv

0+阅读 · 2022年11月1日

Formalizing Statistical Causality via Modal Logic

Arxiv

0+阅读 · 2022年11月1日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

美陆军五大转型方向

一种Agent自主性风险评估框架 | 最新文献

实时无人机指令处理：一种面向无人机系统的大语言模型方法

基于动态知识图谱的人工智能代理自主研究周期 | 文献

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Leveraging Fully Observable Policies for Learning under Partial Observability

Arxiv

0+阅读 · 2022年11月3日

Embed and Emulate: Learning to estimate parameters of dynamical systems with uncertainty quantification

Arxiv

0+阅读 · 2022年11月3日

An Exponentially Converging Particle Method for the Mixed Nash Equilibrium of Continuous Games

An Exponentially Converging Particle Method for the Mixed Nash Equilibrium of Continuous Games

Arxiv

0+阅读 · 2022年11月2日

Consistent Training via Energy-Based GFlowNets for Modeling Discrete Joint Distributions

Arxiv

0+阅读 · 2022年11月2日

Performance-Oriented Design for Intelligent Reflecting Surface Assisted Federated Learning

Arxiv

0+阅读 · 2022年11月2日

Statistical Learning from Biased Training Samples

Arxiv

0+阅读 · 2022年11月1日

Dungeons and Data: A Large-Scale NetHack Dataset

Arxiv

0+阅读 · 2022年11月1日

Phase-based Ranging in Narrowband Systems with Missing/Interfered Tones

Arxiv

0+阅读 · 2022年11月1日

How to train your solver: Verification of boundary conditions for smoothed particle hydrodynamics

Arxiv

0+阅读 · 2022年11月1日

Formalizing Statistical Causality via Modal Logic

Arxiv

0+阅读 · 2022年11月1日

相关基金

概率和平均框架下一系列Sobolev空间中的函数逼近与恢复

国家自然科学基金

1+阅读 · 2015年12月31日

青藏高原与东亚季风区云的时空分布和变化

国家自然科学基金

0+阅读 · 2015年12月31日

石墨烯等离子体晶体：能带的非线性调控和格子孤子

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

大扰动下同步发电机非线性模型和参数的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

具有可见光吸收特性的，基于BODIPY的金属有机骨架材料光催化剂的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Linked Open Data的Web服务语义互操作关键技术

国家自然科学基金

0+阅读 · 2012年12月31日

海洋吡咯生物碱的设计、合成与活性研究

国家自然科学基金

0+阅读 · 2011年12月31日

离子液体功能化手性Bronsted酸催化剂创制及其在催化反应中的应用

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员