带有神经网络的机器人设计、MILP解决方案和积极学习 (Robot Design With Neural Networks, MILP Solvers and Active Learning) - 专知论文

会员服务 ·

0

优化器 · 泛函 · Neural Networks · 学成 · 主动学习 ·

2021 年 2 月 8 日

Robot Design With Neural Networks, MILP Solvers and Active Learning

翻译：带有神经网络的机器人设计、MILP解决方案和积极学习

Sanjai Narain,Emily Mak,Dana Chee,Todd Huster,Jeremy Cohen,Kishore Pochiraju,Brendan Englot,Niraj K. Jha,Karthik Narayan

from arxiv, 22 pages, 8 figures

Central to the design of many robot systems and their controllers is solving a constrained blackbox optimization problem. This paper presents CNMA, a new method of solving this problem that is conservative in the number of potentially expensive blackbox function evaluations; allows specifying complex, even recursive constraints directly rather than as hard-to-design penalty or barrier functions; and is resilient to the non-termination of function evaluations. CNMA leverages the ability of neural networks to approximate any continuous function, their transformation into equivalent mixed integer linear programs (MILPs) and their optimization subject to constraints with industrial strength MILP solvers. A new learning-from-failure step guides the learning to be relevant to solving the constrained optimization problem. Thus, the amount of learning is orders of magnitude smaller than that needed to learn functions over their entire domains. CNMA is illustrated with the design of several robotic systems: wave-energy propelled boat, lunar lander, hexapod, cartpole, acrobot and parallel parking. These range from 6 real-valued dimensions to 36. We show that CNMA surpasses the Nelder-Mead, Gaussian and Random Search optimization methods against the metric of number of function evaluations.

翻译：许多机器人系统及其控制器的设计中心正在解决一个限制的黑盒优化问题。本文展示了CNMA, 这是一种解决这一问题的新方法,在潜在昂贵黑盒功能评估的数量上是保守的; 允许直接具体说明复杂、甚至循环的制约,而不是难以设计的惩罚或屏障功能; 并且能够适应功能评估的不终结。 CNMA利用神经网络的能力来接近任何连续功能,将其转化成等同的混合整形线性程序(MILP),并优化,但受工业实力MILP解决方案的限制。一个新的从失败中学习的步骤引导学习与解决限制的优化问题相关。因此,学习的数量比学习其整个领域功能所需的数量小。 CNMA通过设计若干机器人系统来加以说明:波能驱动船、月球登陆器、六极、马波德、马波尔波尔特、一个crobot和平行停车处。这些系统从6个实际价值层面到36个层面。我们显示CNMA超越了Nlder-Mead, Gaus 和随机优化方法的数量。

0

相关内容

优化器

【经典书】机器学习白话书，97页pdf，Machine Learning for Humans

【经典书】机器学习白话书，97页pdf，Machine Learning for Humans

专知会员服务

87+阅读 · 2021年1月11日

近期必读的 NeurIPS2020 80多篇【图机器学习】相关论文

专知会员服务

54+阅读 · 2020年11月3日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

专知会员服务

74+阅读 · 2020年7月6日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

【新墨西哥大学】深度学习的局限性和缺陷，10页pdf，Deep Learning Limitations and Flaws

【新墨西哥大学】深度学习的局限性和缺陷，10页pdf，Deep Learning Limitations and Flaws

专知会员服务

54+阅读 · 2020年2月5日

【ICLR2020】利用图神经网络进行高效概率逻辑推理，Efficient Probabilistic Logic Reasoning with Graph Neural Networks

【ICLR2020】利用图神经网络进行高效概率逻辑推理，Efficient Probabilistic Logic Reasoning with Graph Neural Networks

专知会员服务

113+阅读 · 2020年1月29日

【Google新论文】Learning Transferable Graph Exploration 附论文下载

【Google新论文】Learning Transferable Graph Exploration 附论文下载

专知会员服务

8+阅读 · 2019年11月4日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

CCF推荐 | 国际会议信息6条

CCF推荐 | 国际会议信息6条

Call4Papers

9+阅读 · 2019年8月13日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

Github项目推荐 | 最优控制、强化学习和运动规划等主题参考文献集锦

Github项目推荐 | 最优控制、强化学习和运动规划等主题参考文献集锦

AI研习社

3+阅读 · 2019年4月21日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

阿里巴巴ET城市大脑

阿里巴巴ET城市大脑

智能交通技术

6+阅读 · 2018年12月23日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Multi-core Fiber and Power-limited Optical Network Topology Optimization with MILP

Arxiv

0+阅读 · 2021年3月31日

Safe and Robust Motion Planning for Dynamic Robotics via Control Barrier Functions

Arxiv

0+阅读 · 2021年3月30日

Binary Graph Neural Networks

Arxiv

1+阅读 · 2021年3月29日

Learning Efficient Constraint Graph Sampling for Robotic Sequential Manipulation

Arxiv

0+阅读 · 2021年3月29日

Bayesian Disturbance Injection: Robust Imitation Learning of Flexible Policies

Arxiv

0+阅读 · 2021年3月27日

Soft Robot Optimal Control Via Reduced Order Finite Element Models

Arxiv

0+阅读 · 2021年3月26日

Learning Discrete Structures for Graph Neural Networks

Arxiv

6+阅读 · 2019年5月17日

Accelerated Methods for Deep Reinforcement Learning

Accelerated Methods for Deep Reinforcement Learning

Arxiv

6+阅读 · 2019年1月10日

Information-Directed Exploration for Deep Reinforcement Learning

Information-Directed Exploration for Deep Reinforcement Learning

Arxiv

5+阅读 · 2018年12月18日

Transfer Learning with Neural AutoML

Arxiv

5+阅读 · 2018年9月11日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

【经典书】机器学习白话书，97页pdf，Machine Learning for Humans

【经典书】机器学习白话书，97页pdf，Machine Learning for Humans

专知会员服务

87+阅读 · 2021年1月11日

近期必读的 NeurIPS2020 80多篇【图机器学习】相关论文

专知会员服务

54+阅读 · 2020年11月3日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

【ICML2020】深度神经网络置信感知学习，Conﬁdence-Aware Learning for Deep Neural Networks

专知会员服务

74+阅读 · 2020年7月6日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

【新墨西哥大学】深度学习的局限性和缺陷，10页pdf，Deep Learning Limitations and Flaws

【新墨西哥大学】深度学习的局限性和缺陷，10页pdf，Deep Learning Limitations and Flaws

专知会员服务

54+阅读 · 2020年2月5日

【ICLR2020】利用图神经网络进行高效概率逻辑推理，Efficient Probabilistic Logic Reasoning with Graph Neural Networks

【ICLR2020】利用图神经网络进行高效概率逻辑推理，Efficient Probabilistic Logic Reasoning with Graph Neural Networks

专知会员服务

113+阅读 · 2020年1月29日

【Google新论文】Learning Transferable Graph Exploration 附论文下载

【Google新论文】Learning Transferable Graph Exploration 附论文下载

专知会员服务

8+阅读 · 2019年11月4日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美陆军特种作战条令》最新102页

《洛克希德SR-71“黑鸟”侦察机动力系统》21页slides

美空军作战实验室通过人工智能和指挥控制技术创新推进杀伤链

《指挥控制能力分析方法论》最新报告

相关资讯

CCF推荐 | 国际会议信息6条

CCF推荐 | 国际会议信息6条

Call4Papers

9+阅读 · 2019年8月13日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

Github项目推荐 | 最优控制、强化学习和运动规划等主题参考文献集锦

Github项目推荐 | 最优控制、强化学习和运动规划等主题参考文献集锦

AI研习社

3+阅读 · 2019年4月21日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

阿里巴巴ET城市大脑

阿里巴巴ET城市大脑

智能交通技术

6+阅读 · 2018年12月23日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Multi-core Fiber and Power-limited Optical Network Topology Optimization with MILP

Arxiv

0+阅读 · 2021年3月31日

Safe and Robust Motion Planning for Dynamic Robotics via Control Barrier Functions

Arxiv

0+阅读 · 2021年3月30日

Binary Graph Neural Networks

Arxiv

1+阅读 · 2021年3月29日

Learning Efficient Constraint Graph Sampling for Robotic Sequential Manipulation

Arxiv

0+阅读 · 2021年3月29日

Bayesian Disturbance Injection: Robust Imitation Learning of Flexible Policies

Arxiv

0+阅读 · 2021年3月27日

Soft Robot Optimal Control Via Reduced Order Finite Element Models

Arxiv

0+阅读 · 2021年3月26日

Learning Discrete Structures for Graph Neural Networks

Arxiv

6+阅读 · 2019年5月17日

Accelerated Methods for Deep Reinforcement Learning

Accelerated Methods for Deep Reinforcement Learning

Arxiv

6+阅读 · 2019年1月10日

Information-Directed Exploration for Deep Reinforcement Learning

Information-Directed Exploration for Deep Reinforcement Learning

Arxiv

5+阅读 · 2018年12月18日

Transfer Learning with Neural AutoML

Arxiv

5+阅读 · 2018年9月11日

微信扫码咨询专知VIP会员