适用于大型彩球和花牌系统的适应性最佳最佳轨迹跟踪控制 (Adaptive Optimal Trajectory Tracking Control Applied to a Large-Scale Ball-on-Plate System) - 专知论文

会员服务 ·

0

控制器 · 优化器 · 可约的 · tuning · 偏移量 ·

2021 年 1 月 25 日

Adaptive Optimal Trajectory Tracking Control Applied to a Large-Scale Ball-on-Plate System

翻译：适用于大型彩球和花牌系统的适应性最佳最佳轨迹跟踪控制

Florian Köpf,Sean Kille,Jairo Inga,Sören Hohmann

from arxiv, F. K\"opf and S. Kille contributed equally to this work. \c{opyright} 2021 IEEE

While many theoretical works concerning Adaptive Dynamic Programming (ADP) have been proposed, application results are scarce. Therefore, we design an ADP-based optimal trajectory tracking controller and apply it to a large-scale ball-on-plate system. Our proposed method incorporates an approximated reference trajectory instead of using setpoint tracking and allows to automatically compensate for constant offset terms. Due to the off-policy characteristics of the algorithm, the method requires only a small amount of measured data to train the controller. Our experimental results show that this tracking mechanism significantly reduces the control cost compared to setpoint controllers. Furthermore, a comparison with a model-based optimal controller highlights the benefits of our model-free data-based ADP tracking controller, where no system model and manual tuning are required but the controller is tuned automatically using measured data.

翻译：虽然提出了许多关于适应动态程序(ADP)的理论工作,但应用结果却很少。因此,我们设计了一个基于ADP的最佳轨迹跟踪控制器,并将其应用到一个大型板球系统。我们提议的方法包含一个近似参考轨迹,而不是使用定点跟踪,并允许自动补偿固定抵消条件。由于算法的非政策性特点,该方法只需要少量测量数据来培训控制器。我们的实验结果显示,与设置点控制器相比,这一跟踪机制大大降低了控制成本。此外,与基于模型的最佳控制器的比较凸显了我们基于无模型的数据的ADP跟踪控制器的好处,不需要系统模型和手动调整,但控制器会自动使用计量数据调整。

0

相关内容

控制器

【经典书】Python金融大数据分析（Yves Hilpsch 著），566页pdf

【经典书】Python金融大数据分析（Yves Hilpsch 著），566页pdf

专知会员服务

97+阅读 · 2021年1月9日

【MIT】反偏差对比学习，Debiased Contrastive Learning

【MIT】反偏差对比学习，Debiased Contrastive Learning

专知会员服务

91+阅读 · 2020年7月4日

【IJCAI2020】基于生成对抗模仿学习的多模态模仿学习算法框架

【IJCAI2020】基于生成对抗模仿学习的多模态模仿学习算法框架

专知会员服务

58+阅读 · 2020年5月26日

【google】监督对比学习，Supervised Contrastive Learning

【google】监督对比学习，Supervised Contrastive Learning

专知会员服务

32+阅读 · 2020年4月23日

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

专知会员服务

136+阅读 · 2020年3月8日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【IJCAI 2019 | tutorial】材料学与AI AI for Materials Science , Lars Kotthof

【IJCAI 2019 | tutorial】材料学与AI AI for Materials Science , Lars Kotthof

专知会员服务

18+阅读 · 2019年8月12日

最前沿：深度解读Soft Actor-Critic 算法

最前沿：深度解读Soft Actor-Critic 算法

极市平台

55+阅读 · 2019年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

将门创投

3+阅读 · 2019年4月25日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Machine Learning：十大机器学习算法

Machine Learning：十大机器学习算法

开源中国

21+阅读 · 2018年3月1日

推荐｜深度强化学习聊天机器人（附论文）！

推荐｜深度强化学习聊天机器人（附论文）！

全球人工智能

4+阅读 · 2018年1月30日

深度学习医学图像分析文献集

深度学习医学图像分析文献集

机器学习研究会

19+阅读 · 2017年10月13日

Collision Avoidance in Tightly-Constrained Environments without Coordination: a Hierarchical Control Approach

Collision Avoidance in Tightly-Constrained Environments without Coordination: a Hierarchical Control Approach

Arxiv

0+阅读 · 2021年3月18日

An Inextensible Model for Robotic Simulations of Textiles

Arxiv

0+阅读 · 2021年3月17日

Self-Validated Ensemble Models for Design of Experiments

Arxiv

0+阅读 · 2021年3月16日

Learning to Shape Rewards using a Game of Switching Controls

Learning to Shape Rewards using a Game of Switching Controls

Arxiv

0+阅读 · 2021年3月16日

Model-Based Actor-Critic with Chance Constraint for Stochastic System

Arxiv

0+阅读 · 2021年3月16日

Closed-Loop Error Learning Control for Uncertain Nonlinear Systems With Experimental Validation on a Mobile Robot

Arxiv

0+阅读 · 2021年3月16日

Training a Single Bandit Arm

Arxiv

0+阅读 · 2021年3月15日

Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences

Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences

Arxiv

5+阅读 · 2018年7月23日

A survey on policy search algorithms for learning robot controllers in a handful of trials

Arxiv

3+阅读 · 2018年7月6日

Safety-aware Adaptive Reinforcement Learning with Applications to Brushbot Navigation

Arxiv

4+阅读 · 2018年1月29日

VIP会员

文章信息

相关主题

相关VIP内容

【经典书】Python金融大数据分析（Yves Hilpsch 著），566页pdf

【经典书】Python金融大数据分析（Yves Hilpsch 著），566页pdf

专知会员服务

97+阅读 · 2021年1月9日

【MIT】反偏差对比学习，Debiased Contrastive Learning

【MIT】反偏差对比学习，Debiased Contrastive Learning

专知会员服务

91+阅读 · 2020年7月4日

【IJCAI2020】基于生成对抗模仿学习的多模态模仿学习算法框架

【IJCAI2020】基于生成对抗模仿学习的多模态模仿学习算法框架

专知会员服务

58+阅读 · 2020年5月26日

【google】监督对比学习，Supervised Contrastive Learning

【google】监督对比学习，Supervised Contrastive Learning

专知会员服务

32+阅读 · 2020年4月23日

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

基于动态时空图CNNs的交通流预测，Dynamic Spatio-temporal Graph-based CNNs for Traffic Flow Prediction

专知会员服务

136+阅读 · 2020年3月8日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【IJCAI 2019 | tutorial】材料学与AI AI for Materials Science , Lars Kotthof

【IJCAI 2019 | tutorial】材料学与AI AI for Materials Science , Lars Kotthof

专知会员服务

18+阅读 · 2019年8月12日

热门VIP内容

开通专知VIP会员享更多权益服务

《太空边缘（临近空间）的武器化？军事高空平台的进展与前景》

《利用星基增强系统（SBAS）信号进行射频干扰（RFI）检测与特征分析》

美陆军在“艾布拉姆斯”坦克与“布拉德利”步战车上测试“牛蛙”反无人机炮塔

《军事领域特性及其对军事人工智能应用的影响》

相关资讯

最前沿：深度解读Soft Actor-Critic 算法

最前沿：深度解读Soft Actor-Critic 算法

极市平台

55+阅读 · 2019年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

将门创投

3+阅读 · 2019年4月25日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Machine Learning：十大机器学习算法

Machine Learning：十大机器学习算法

开源中国

21+阅读 · 2018年3月1日

推荐｜深度强化学习聊天机器人（附论文）！

推荐｜深度强化学习聊天机器人（附论文）！

全球人工智能

4+阅读 · 2018年1月30日

深度学习医学图像分析文献集

深度学习医学图像分析文献集

机器学习研究会

19+阅读 · 2017年10月13日

相关论文

Collision Avoidance in Tightly-Constrained Environments without Coordination: a Hierarchical Control Approach

Collision Avoidance in Tightly-Constrained Environments without Coordination: a Hierarchical Control Approach

Arxiv

0+阅读 · 2021年3月18日

An Inextensible Model for Robotic Simulations of Textiles

Arxiv

0+阅读 · 2021年3月17日

Self-Validated Ensemble Models for Design of Experiments

Arxiv

0+阅读 · 2021年3月16日

Learning to Shape Rewards using a Game of Switching Controls

Learning to Shape Rewards using a Game of Switching Controls

Arxiv

0+阅读 · 2021年3月16日

Model-Based Actor-Critic with Chance Constraint for Stochastic System

Arxiv

0+阅读 · 2021年3月16日

Closed-Loop Error Learning Control for Uncertain Nonlinear Systems With Experimental Validation on a Mobile Robot

Arxiv

0+阅读 · 2021年3月16日

Training a Single Bandit Arm

Arxiv

0+阅读 · 2021年3月15日

Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences

Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences

Arxiv

5+阅读 · 2018年7月23日

A survey on policy search algorithms for learning robot controllers in a handful of trials

Arxiv

3+阅读 · 2018年7月6日

Safety-aware Adaptive Reinforcement Learning with Applications to Brushbot Navigation

Arxiv

4+阅读 · 2018年1月29日

微信扫码咨询专知VIP会员