混合系统神经控制与区域吸引力规划器 (Hybrid Systems Neural Control with Region-of-Attraction Planner) - 专知论文

会员服务 ·

0

控制器 · Lyapunov · 峰值 · Learning · Continuity ·

2023 年 3 月 18 日

Hybrid Systems Neural Control with Region-of-Attraction Planner

翻译：混合系统神经控制与区域吸引力规划器

Yue Meng,Chuchu Fan

from arxiv, Accepted to L4DC2023

Hybrid systems are prevalent in robotics. However, ensuring the stability of hybrid systems is challenging due to sophisticated continuous and discrete dynamics. A system with all its system modes stable can still be unstable. Hence special treatments are required at mode switchings to stabilize the system. In this work, we propose a hierarchical, neural network (NN)-based method to control general hybrid systems. For each system mode, we first learn an NN Lyapunov function and an NN controller to ensure the states within the region of attraction (RoA) can be stabilized. Then an RoA NN estimator is learned across different modes. Upon mode switching, we propose a differentiable planner to ensure the states after switching can land in next mode's RoA, hence stabilizing the hybrid system. We provide novel theoretical stability guarantees and conduct experiments in car tracking control, pogobot navigation, and bipedal walker locomotion. Our method only requires 0.25X of the training time as needed by other learning-based methods. With low running time (10-50X faster than model predictive control (MPC)), our controller achieves a higher stability/success rate over other baselines such as MPC, reinforcement learning (RL), common Lyapunov methods (CLF), linear quadratic regulator (LQR), quadratic programming (QP) and Hamilton-Jacobian-based methods (HJB). The project page is on https://mit-realm.github.io/hybrid-clf.

翻译：混合系统在机器人技术中应用广泛。然而，由于其复杂的连续和离散动态，在确保混合系统的稳定性方面具有挑战性。即使一个系统的所有系统模式都是稳定的，它仍然可能不稳定，在模式转换时需要特殊处理来稳定系统。在本研究中，我们提出了一种基于神经网络（NN）的分层方法来控制一般混合系统。针对每个系统模式，我们首先学习一个NN李雅普诺夫函数和一个控制器，以确保可以稳定处于吸引子区域（RoA）内的状态。然后学习一个RoA NN估计器来跨越不同模式进行估计。在模式切换时，提出了一个可微分的规划器，以确保切换后的状态可以落在下一个模式的RoA中，从而稳定混合系统。我们提供了新颖的理论稳定性保证，并在汽车跟踪控制，pogobot导航和双足行走器运动等领域进行了实验。我们的方法只需要其他基于学习的方法所需的0.25倍的训练时间。在运行时间较短（比模型预测控制（MPC）快10-50倍）的情况下，我们的控制器在其他基线（如MPC、强化学习（RL）、常见李雅普诺夫方法（CLF）、线性二次调节器（LQR）、二次规划（QP）和基于哈密尔顿-雅各比方法（HJB）的控制器）上实现了更高的稳定性/成功率。项目页面位于https://mit-realm.github.io/hybrid-clf。

0

相关内容

控制器

【开放书】设计机器学习系统，Designing Machine Learning Systems

【开放书】设计机器学习系统，Designing Machine Learning Systems

专知会员服务

77+阅读 · 2022年5月17日

【硬核书】规划算法 (Planning Algorithm)，1023页pdf，Steven M. Illinois大学

【硬核书】规划算法 (Planning Algorithm)，1023页pdf，Steven M. Illinois大学

专知会员服务

167+阅读 · 2022年4月10日

【CVPR2022】多机器人协同主动建图算法

【CVPR2022】多机器人协同主动建图算法

专知会员服务

49+阅读 · 2022年4月3日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

最新《对抗机器学习》报告，EPFL-Volkan教授讲解AML中的优化问题

最新《对抗机器学习》报告，EPFL-Volkan教授讲解AML中的优化问题

专知会员服务

36+阅读 · 2021年1月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

专知会员服务

158+阅读 · 2020年1月29日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

17种深度强化学习算法用Pytorch实现

17种深度强化学习算法用Pytorch实现

新智元

31+阅读 · 2019年9月16日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【泡泡一分钟】从三维流动中学习单目视觉里程计及三维稠密建图

【泡泡一分钟】从三维流动中学习单目视觉里程计及三维稠密建图

泡泡机器人SLAM

12+阅读 · 2019年2月12日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

安装TensorFlow 2.0 preview进行深度学习（附Jupyter Notebook）

安装TensorFlow 2.0 preview进行深度学习（附Jupyter Notebook）

专知

10+阅读 · 2019年1月11日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

多层时空并行 Schwarz 算法的研究

国家自然科学基金

3+阅读 · 2017年12月31日

基于正交投影迭代学习的高频响直线伺服系统重复性扰动辨识研究

国家自然科学基金

0+阅读 · 2013年12月31日

控制方向未知的随机非线性系统的神经网络自适应控制

国家自然科学基金

2+阅读 · 2013年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

有限时间时滞混沌同步及其FPGA 实现

国家自然科学基金

0+阅读 · 2012年12月31日

面向属性的CPN建模及On the Fly辅助的测试生成方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

混合系统优化控制问题研究

国家自然科学基金

0+阅读 · 2011年12月31日

不确定随机非线性系统的自适应动态面控制研究

国家自然科学基金

0+阅读 · 2011年12月31日

图像处理问题的快速数值方法

国家自然科学基金

1+阅读 · 2008年12月31日

非线性不连续系统的稳定与镇定

国家自然科学基金

0+阅读 · 2008年12月31日

Neural Lyapunov Control for Discrete-Time Systems

Arxiv

0+阅读 · 2023年5月11日

'Put the Car on the Stand': SMT-based Oracles for Investigating Decisions

Arxiv

0+阅读 · 2023年5月9日

Pseudo-Hamiltonian system identification

Arxiv

0+阅读 · 2023年5月9日

Adaptive Localized Reduced Basis Methods for Large Scale Parameterized Systems

Arxiv

0+阅读 · 2023年5月9日

Graph Neural Network-based surrogate model for granular flows

Arxiv

0+阅读 · 2023年5月9日

A Comprehensive Survey on Multimodal Recommender Systems: Taxonomy, Evaluation, and Future Directions

Arxiv

16+阅读 · 2023年2月9日

NeuroFluid: Fluid Dynamics Grounding with Particle-Driven Neural Radiance Fields

Arxiv

14+阅读 · 2022年3月3日

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Arxiv

21+阅读 · 2019年3月27日

Ripple Network: Propagating User Preferences on the Knowledge Graph for Recommender Systems

Arxiv

12+阅读 · 2018年3月9日

A Survey on Dialogue Systems: Recent Advances and New Frontiers

Arxiv

11+阅读 · 2018年1月11日

VIP会员

文章信息

相关主题

相关VIP内容

【开放书】设计机器学习系统，Designing Machine Learning Systems

【开放书】设计机器学习系统，Designing Machine Learning Systems

专知会员服务

77+阅读 · 2022年5月17日

【硬核书】规划算法 (Planning Algorithm)，1023页pdf，Steven M. Illinois大学

【硬核书】规划算法 (Planning Algorithm)，1023页pdf，Steven M. Illinois大学

专知会员服务

167+阅读 · 2022年4月10日

【CVPR2022】多机器人协同主动建图算法

【CVPR2022】多机器人协同主动建图算法

专知会员服务

49+阅读 · 2022年4月3日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

最新《对抗机器学习》报告，EPFL-Volkan教授讲解AML中的优化问题

最新《对抗机器学习》报告，EPFL-Volkan教授讲解AML中的优化问题

专知会员服务

36+阅读 · 2021年1月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

经典书《斯坦福大学-多智能体系统》532页pdf，MULTIAGENT SYSTEMS Algorithmic, Game-Theoretic, and Logical Foundations

专知会员服务

158+阅读 · 2020年1月29日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

数据要素发展报告(2025年)：附下载

人工智能代理提升战时舰船战备水平

【NeurIPS2025教程】大语言模型规划

NeurIPS 2025 教程：深度学习训练不稳定性的理论洞见

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

17种深度强化学习算法用Pytorch实现

17种深度强化学习算法用Pytorch实现

新智元

31+阅读 · 2019年9月16日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【泡泡一分钟】从三维流动中学习单目视觉里程计及三维稠密建图

【泡泡一分钟】从三维流动中学习单目视觉里程计及三维稠密建图

泡泡机器人SLAM

12+阅读 · 2019年2月12日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

安装TensorFlow 2.0 preview进行深度学习（附Jupyter Notebook）

安装TensorFlow 2.0 preview进行深度学习（附Jupyter Notebook）

专知

10+阅读 · 2019年1月11日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

相关论文

Neural Lyapunov Control for Discrete-Time Systems

Arxiv

0+阅读 · 2023年5月11日

'Put the Car on the Stand': SMT-based Oracles for Investigating Decisions

Arxiv

0+阅读 · 2023年5月9日

Pseudo-Hamiltonian system identification

Arxiv

0+阅读 · 2023年5月9日

Adaptive Localized Reduced Basis Methods for Large Scale Parameterized Systems

Arxiv

0+阅读 · 2023年5月9日

Graph Neural Network-based surrogate model for granular flows

Arxiv

0+阅读 · 2023年5月9日

A Comprehensive Survey on Multimodal Recommender Systems: Taxonomy, Evaluation, and Future Directions

Arxiv

16+阅读 · 2023年2月9日

NeuroFluid: Fluid Dynamics Grounding with Particle-Driven Neural Radiance Fields

Arxiv

14+阅读 · 2022年3月3日

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Rethinking Knowledge Graph Propagation for Zero-Shot Learning

Arxiv

21+阅读 · 2019年3月27日

Ripple Network: Propagating User Preferences on the Knowledge Graph for Recommender Systems

Arxiv

12+阅读 · 2018年3月9日

A Survey on Dialogue Systems: Recent Advances and New Frontiers

Arxiv

11+阅读 · 2018年1月11日

相关基金

多层时空并行 Schwarz 算法的研究

国家自然科学基金

3+阅读 · 2017年12月31日

基于正交投影迭代学习的高频响直线伺服系统重复性扰动辨识研究

国家自然科学基金

0+阅读 · 2013年12月31日

控制方向未知的随机非线性系统的神经网络自适应控制

国家自然科学基金

2+阅读 · 2013年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

有限时间时滞混沌同步及其FPGA 实现

国家自然科学基金

0+阅读 · 2012年12月31日

面向属性的CPN建模及On the Fly辅助的测试生成方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

混合系统优化控制问题研究

国家自然科学基金

0+阅读 · 2011年12月31日

不确定随机非线性系统的自适应动态面控制研究

国家自然科学基金

0+阅读 · 2011年12月31日

图像处理问题的快速数值方法

国家自然科学基金

1+阅读 · 2008年12月31日

非线性不连续系统的稳定与镇定

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员